Browse > Article
http://dx.doi.org/10.9708/jksci.2020.25.11.041

Exploring the Feature Selection Method for Effective Opinion Mining: Emphasis on Particle Swarm Optimization Algorithms  

Eo, Kyun Sun (SKK Business School, Sungkyunkwan University)
Lee, Kun Chang (SKK Business School/SAIHST (Samsung Advanced Institute of Health Sciences & Technology), Sungkyunkwan University)
Abstract
Sentimental analysis begins with the search for words that determine the sentimentality inherent in data. Managers can understand market sentimentality by analyzing a number of relevant sentiment words which consumers usually tend to use. In this study, we propose exploring performance of feature selection methods embedded with Particle Swarm Optimization Multi Objectives Evolutionary Algorithms. The performance of the feature selection methods was benchmarked with machine learning classifiers such as Decision Tree, Naive Bayesian Network, Support Vector Machine, Random Forest, Bagging, Random Subspace, and Rotation Forest. Our empirical results of opinion mining revealed that the number of features was significantly reduced and the performance was not hurt. In specific, the Support Vector Machine showed the highest accuracy. Random subspace produced the best AUC results.
Keywords
Sentiment analysis; Feature selection; Particle Swarm Optimization; Multi Objective Evolutionary Algorithm; Machine learning;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 L. D. Vignolo, D. H. Milone, and J. Scharcanski, "Feature Selection for Face Recognition Based on Multi-Objective Evolutionary Wrappers," Expert Systems with Applications, Vol. 40, No. 13, pp. 5077-5084, Oct 2013. DOI: 10.1016/j.eswa.2013.03.032   DOI
2 U. Mlakar, I. Fister, J. Brest, and B. Potocnik, "Multi-Objective Differential Evolution for Feature Selection in Facial Expression Recognition Systems," Expert Systems with Applications, Vol. 89, pp. 129-137, Dec 2017. DOI: 10.1016/j.eswa.2017.07.037   DOI
3 K. Xu, G. Qi, J. Huang, T. Wu, and X. Fu, "Detecting Bursts in Sentiment-Aware Topics from Social Media," Knowledge-Based Systems, Vol. 141, pp. 44-54, Feb 2018. DOI: 10.1016/j.knosys.2017.11.007   DOI
4 A. Aizawa, "An Information-Theoretic Perspective of Tf-Idf Measures," Information Processing & Management, Vol. 39, No. 1, pp. 45-65, Jan 2003. DOI: 10.1016/S0306-4573(02)00021-3   DOI
5 F. Jimenez, G. Sanchez, J. M. Garcia, G. Sciavicco, and L. Miralles, "Multi-Objective Evolutionary Feature Selection for Online Sales Forecasting," Neurocomputing, Vol. 234, pp. 75-92, Apr 2017. DOI: 10.1016/j.neucom.2016.12.045   DOI
6 Y. Zhang, D. W. Gong, X. Z. Gao, T. Tian, & X. Y. Sun, "Binary Fifferential Evolution with Self-Learning for Multi-Objective Feature Selection," Information Sciences, Vol. 507, pp. 67-85, Jan 2020. DOI: 10.1016/j.ins.2019.08.040   DOI
7 S. Rosenthal, N. Farra, and P. Nakov, "SemEval-2017 Task 4: Sentiment Analysis in Twitter," arXiv preprint arXiv:1912.00741, 2019. DOI: 10.18653/v1/S17-2088   DOI
8 S. Arlot, and A. Celisse, "A Survey of Cross-Validation Procedures for Model Selection,". Statistics Surveys, Vol. 4, pp. 40-79, 2010. DOI: 10.1214/09-SS054   DOI
9 K. S. Eo, and K. C. Lee, "Exploring an Optimal Feature Selection Method for Effective Opinion Mining Tasks," Journal of the Korea Society of Computer and Information, Vol. 24, No. 2, pp. 171-177, Feb 2019. DOI: 10.9708/jksci.2019.24.02.171   DOI
10 A. Yadollahi, A. G. Shahraki, and O. R. Zaiane, "Current State of Text Sentiment Analysis from Opinion to Emotion Mining," ACM Computing Surveys (CSUR), Vol. 50, No.2, pp. 1-33, May 2017. DOI: 10.1145/3057270   DOI
11 N. F. Da Silva, E. R. Hruschka, and E. R. Hruschka Jr, "Tweet Sentiment Analysis with Classifier Ensembles," Decision Support Systems, Vol. 66, pp. 170-179, Oct 2014. DOI: 10.1016/j.dss.2014.07.003   DOI
12 M. A. Friedl, and C. E. Brodley, "Decision Tree Classification of Land Cover from Remotely Sensed Data," Remote sensing of environment, Vol. 61, No. 3, pp. 399-409, Sep 1997. DOI: 10.1016/S0034-4257(97)00049-7   DOI
13 N. Friedman, D. Geiger, and M. Goldszmidt, "Bayesian Network Classifiers," Machine Learning, Vol. 29, No. 2-3, pp. 131-163, Nov 1997. DOI: 10.1023/A:1007465528199   DOI
14 J. A. Suykens, and J. Vandewalle, "Least Squares Support Vector Machine Classifiers,". Neural Processing Letters, Vol. 9, No. 3, pp. 293-300, Jun 1999. DOI: 10.1023/A:1018628609742   DOI
15 L. Breiman, "Random Forests," Machine Learning, Vol. 45, No. 1, pp. 5-32, Oct 2001. DOI: 10.1023/A:1010933404324   DOI
16 F. Corea, "Can Twitter Proxy the Investors' Sentiment? The Case for The Technology Sector," Big Data Research, Vol. 4, pp. 70-74, Jun 2016. DOI: 10.1016/j.bdr.2016.05.001   DOI
17 L. Breiman, "Bagging Predictors," Machine Learning, Vol. 24, No. 2, pp. 123-140, Aug 1996. DOI: 10.1007/BF00058655   DOI
18 T. K. Ho, "The Random Subspace Method for Constructing Decision Forests," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 8, pp. 832-844, Aug 1998. DOI: 10.1109/34.709601   DOI
19 Q. Ye, Z. Zhang, and R. Law, "Sentiment Classification of Online Reviews to Travel Destinations by Supervised Machine Learning Approaches," Expert Systems with Applications, Vol. 36, No. 3, pp. 6527-6535, Apr 2009. DOI: 10.1016/j.eswa.2008.07.035   DOI
20 R. Moraes, J. F. Valiati, and W. P. G. Neto, "Document-Level Sentiment Classification: An Empirical Comparison Between SVM and ANN," Expert Systems with Applications, Vol. 40, No. 2, pp. 621-633, Feb 2013. DOI: 10.1016/j.eswa.2012.07.059   DOI
21 Y. Ruan, A. Durresi, and L. Alfantoukh, "Using Twitter Trust Network for Stock Market Analysis," Knowledge-Based Systems, Vol. 145, pp. 207-218, Apr 2018. DOI: 10.1016/j.knosys.2018.01.016   DOI
22 M. Ghiassi, J. Skinner, and D. Zimbra, "Twitter Brand Sentiment Analysis: A hybrid System Using N-Gram Analysis and Dynamic Artificial Neural Network," Expert Systems with Applications, Vol. 40, No. 16, pp. 6266-6282, Nov 2013. DOI: 10.1016/j.eswa.2013.05.057   DOI
23 G. Wang, J. Sun, J. Ma, K. Xu, and J. Gu, "Sentiment Classification: The Contribution of Ensemble Learning," Decision Support Systems, Vol. 57, pp. 77-93, Jan 2014. DOI: 10.1016/j.dss.2013.08.002   DOI
24 N. A. Krisshna, V. K. Deepak, K. Manikantan, and S. Ramachandran, "Face Recognition Using Transform Domain Feature Extraction And PSO-Based Feature Selection," Applied Soft Computing, Vol. 22, pp. 141-161, Sep 2014. 10.1016/j.asoc.2014.05.007   DOI
25 S. Yoo, J. Song, & O. Jeong, "Social Media Contents Based Sentiment Analysis and Prediction System," Expert Systems with Applications, Vol. 105, pp. 102-111, Sep 2018. DOI: 10.1016/j.eswa.2018.03.055   DOI
26 A. Garcia-Pablos, M. Cuadros, & G. Rigau, "W2VLDA: Almost Snsupervised System for Aspect Based Sentiment Analysis," Expert Systems with Applications, Vol. 91, pp. 127-137, Jan 2018. DOI: 10.1016/j.eswa.2017.08.049   DOI
27 M. Amoozegar, and B. Minaei-Bidgoli, "Optimizing Multi-Objec tive PSO Based Feature Selection Method Using a Feature Elitism Mechanism," Expert Systems with Applications, Vol. 113, pp. 499-514, Dec 2018. DOI: 10.1016/j.eswa.2018.07.013   DOI
28 B. Xue, M. Zhang, and W. N. Browne, "Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach," IEEE Transactions on Cybernetics, Vol. 43, No. 6, pp. 1656-1671, Dec 2012. DOI: 10.1109/TSMCB.2012.2227469.   DOI
29 L. Cervante, B. Xue, M. Zhang, and L. Shang, "Binary Particle Swarm Optimisation for Feature selection: A Filter Based Approach," In 2012 IEEE Congress on Evolutionary Computation, pp. 1-8, 2012. DOI: 10.1109/CEC.2012.6256452.   DOI
30 N. Kushwaha, and M. Pant, "Link Based BPSO for Feature Selection in Big Data Text Clustering," Future Generation Computer Systems, Vol. 82, pp. 190-199, May 2018. DOI: 10.1016/j.future.2017.12.005   DOI
31 Z. Wang, M. Li, and J. Li, "A Multi-Objective Evolutionary Algorithm for Feature Selection Based on Mutual Information with a New Redundancy Measure," Information Sciences, Vol. 307, pp. 73-88, Jun 2015. DOI: 10.1016/j.ins.2015.02.031   DOI
32 A. Gaspar-Cunha, "Feature Selection Using Multi-Objective Evolutionary Algorithms: Application to Cardiac SPECT Diagnosis," In Advances in Bioinformatics, pp. 85-92, 2010. DOI: 10.1007/978-3-642-13214-8_11   DOI
33 J. B. Park, K. S. Lee, J. R. Shin, and K. Y. Lee, "A Particle Swarm Optimization for Economic Dispatch with Nonsmooth Cost Functions," IEEE Transactions on Power Systems, Vol. 20, No. 1, pp. 34-42, Jan 2005. DOI: 10.1109/TPWRS.2004.831275.   DOI