DOI QR코드

DOI QR Code

FEROM: Feature Extraction and Refinement for Opinion Mining

  • Jeong, Ha-Na (Department of IT Research, IBK Systems Corp., Daum Communications Corp.) ;
  • Shin, Dong-Wook (Department of Computer Science and Engineering, Hanyang University) ;
  • Choi, Joong-Min (Department of Computer Science and Engineering, Hanyang University)
  • Received : 2010.10.23
  • Accepted : 2011.04.07
  • Published : 2011.10.31

Abstract

Opinion mining involves the analysis of customer opinions using product reviews and provides meaningful information including the polarity of the opinions. In opinion mining, feature extraction is important since the customers do not normally express their product opinions holistically but separately according to its individual features. However, previous research on feature-based opinion mining has not had good results due to drawbacks, such as selecting a feature considering only syntactical grammar information or treating features with similar meanings as different. To solve these problems, this paper proposes an enhanced feature extraction and refinement method called FEROM that effectively extracts correct features from review data by exploiting both grammatical properties and semantic characteristics of feature words and refines the features by recognizing and merging similar ones. A series of experiments performed on actual online review data demonstrated that FEROM is highly effective at extracting and refining features for analyzing customer review data and eventually contributes to accurate and functional opinion mining.

Keywords

References

  1. J. Willis, "What Impact Will E-Commerce Have on the U.S. Economy?" Economic Review, Federal Reserve Bank of Kansas City, vol. 89, no. 2, 2004, pp. 53-71.
  2. N. Li and Z. Ping, "Consumer Online Shopping Attitudes and Behavior: An Assessment of Research," Proc. 8th Americas Conf. Inf. Syst., 2002, pp. 508-517.
  3. B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Info. Retrieval, vol. 2, no. 1-2, 2008, pp. 1-135. https://doi.org/10.1561/1500000011
  4. A. Popescu and O. Etzioni, "Extracting Product Features and Opinions from Reviews," Proc. Conf. Human Language Technol. Empirical Methods Natural Language Process., 2005, pp. 339-346.
  5. B. Liu, M. Hu, and J. Cheng, "Opinion Observer: Analyzing and Comparing Opinions on the Web," Proc. 14th Int. Conf. World Wide Web, 2005, pp. 342-351.
  6. Y. Kim et al., "Feature Selection in Data Mining," Data Mining: Opportunities and Challenges, Idea Group Publishing, 2003, pp. 80-105.
  7. A. Kotcz, P. Vidya, and K. Jugal, "Summarization as Feature Selection for Text Categorization," Proc. 10th Intl. Conf. Inf. Knowl. Manag., 2001, pp. 365-370.
  8. B. Liu, "Opinion Mining," Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer, 2007, pp. 411- 448.
  9. B. Liu and M. Hu, "Mining Opinion Features in Customer Reviews," Proc. 19th Nat. Conf. Artificial Int., 2004, pp. 755-760.
  10. X. Ding, B. Liu, and Y. Philip, "A Holistic Lexicon-Based Approach to Opinion Mining," Proc. Int. Conf. Web Search Web Data Mining, 2008, pp. 231-240.
  11. A. Abbasi, H. Chen, and A. Salem, "Sentiment Analysis in Multiple Languages: Feature Selection for Opinion Classification," ACM Trans. Inf. Syst., vol. 26, no. 3, 2008, pp 1- 34.
  12. S. Das and M. Chen, "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Manag. Sci., vol. 53, no. 9, 2001, pp. 1375-1388.
  13. S. Aciar et al., "Informed Recommender: Basing Recommendations on Consumer Product Reviews," IEEE Intell. Syst., vol. 22, no. 3, 2007, pp. 39-47. https://doi.org/10.1109/MIS.2007.55
  14. NLProcessor-Text Analysis Toolkit, 2000. http://www. infogistics.com/textanalysis
  15. Porter's Stemming Algorithm. http://tartarus.org/-martin/ PorterStemmer/
  16. O. Schiller and A. Caramazza, "Grammatical Feature Selection in Noun Phrase Production: Evidence from German and Dutch," J. Memory and Language, vol. 48, no. 1, 2003, pp. 169-194. https://doi.org/10.1016/S0749-596X(02)00508-9
  17. G. Miller et al., "Introduction to WordNet: An On-line Lexical Database," Int. J. Lexicography, vol. 3, no. 4, 1990, pp. 235-244. https://doi.org/10.1093/ijl/3.4.235
  18. R. Beaza-Yates and B. Ribeiro-Neto, Modern Information Retrieval, Addison-Wesley, 1999.

Cited by

  1. Fuzzy Domain Ontology-based Opinion Mining for Transportation Network Monitoring and City Features Map vol.15, pp.1, 2016, https://doi.org/10.12815/kits.2016.15.1.109
  2. A review of feature selection techniques in sentiment analysis vol.23, pp.1, 2011, https://doi.org/10.3233/ida-173763
  3. Extraction Sentiment Analysis Using naive Bayes Algorithm and Reducing Noise Word applied in Indonesian Language vol.835, pp.None, 2011, https://doi.org/10.1088/1757-899x/835/1/012051
  4. CROSA: Context‐aware cloud service ranking approach using online reviews based on sentiment analysis vol.33, pp.7, 2021, https://doi.org/10.1002/cpe.5358