Browse > Article
http://dx.doi.org/10.22937/IJCSNS.2022.22.9.36

A Novel Feature Selection Approach to Classify Breast Cancer Drug using Optimized Grey Wolf Algorithm  

Shobana, G. (Department of Computer Applications Madras Christian College)
Priya, N. (PG Department of Computer Science SDNB Vaishnav College for Women)
Publication Information
International Journal of Computer Science & Network Security / v.22, no.9, 2022 , pp. 258-270 More about this Journal
Abstract
Cancer has become a common disease for the past two decades throughout the globe and there is significant increase of cancer among women. Breast cancer and ovarian cancers are more prevalent among women. Majority of the patients approach the physicians only during their final stage of the disease. Early diagnosis of cancer remains a great challenge for the researchers. Although several drugs are being synthesized very often, their multi-benefits are less investigated. With millions of drugs synthesized and their data are accessible through open repositories. Drug repurposing can be done using machine learning techniques. We propose a feature selection technique in this paper, which is novel that generates multiple populations for the grey wolf algorithm and classifies breast cancer drugs efficiently. Leukemia drug dataset is also investigated and Multilayer perceptron achieved 96% prediction accuracy. Three supervised machine learning algorithms namely Random Forest classifier, Multilayer Perceptron and Support Vector Machine models were applied and Multilayer perceptron had higher accuracy rate of 97.7% for breast cancer drug classification.
Keywords
Supervised Machine learning; Grey Wolf Algorithm; Random Forest; Support Vector Machine; Multilayer Perceptron;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Dharani T and Hariprasath S, "Diagnosis of Leukemia and its types Using Digital Image Processing Tehniques", Proceedings of the International Conference on Communication and Electronics Systems (ICCES), IEEE, 2018.
2 Baptista D, Ferreira PG, Rocha M. Deep learning for drug response prediction in cancer. Brief Bioinform. 2021 Jan 18;22(1):360-379. Doi: 10.1093/bib/bbz171. PMID: 31950132.   DOI
3 G Shobana, N Priya, A New Multi-Phase Feature Selection Framework for The Prediction of Breast Cancer Drug Using Machine Learning Techniques, Journal of Algebraic Statistics 13 (2), 300-312(2022).
4 M. -Z. Tsai, P. -Y. Yang, F. -I. Chou and J. -H. Chou, "Parameters Optimization for Improved Grey Wolf Optimizer by Using Uniform Experimental Design," 2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), 2021, pp. 1-2, doi: 10.1109/ISPACS51563.2021.9651072.   DOI
5 S. Subbiah, K. S. M. Anbananthen, S. Thangaraj, S. Kannan and D. Chelliah, "Intrusion detection technique in wireless sensor network using grid search random forest with Boruta feature selection algorithm," in Journal of Communications and Networks, vol. 24, no. 2, pp. 264-273, April 2022, doi: 10.23919/JCN.2022.000002.   DOI
6 David Chen, Gaurav Goyal, Ronald Go, Sameer Parikh, Che Ngufor, "Predicting Time to First Treatment in Chronic Lymphocytic Leukemia using Machine Learning Survival and Classification Methods", IEEE International Conference on Healthcare Informatics, IEEE, 2018.
7 Jakkrich Laosai and Kosin Chamnongthai, "Deep-learning-Based Acute Leukemia Classification Using Imaging Flow Cytometry and Morphology", International Workshop on Smart Info-Media Systems in Asia (SISA), IEEE, 2018.
8 Borrero, Luz & Guette, Lilibeth & Lopez, Enrique & Pineda, Omar & Buelvas, Edgardo. (2020). Predicting Toxicity Properties through Machine Learning. Procedia Computer Science. 170. 1011-1016. 10.1016/j.procs.2020.03.093.   DOI
9 A. Jha, G. Verma, Y. Khan, Q. Mehmood, D. Rebholz-Schuhmann and R. Sahay, "Deep Convolution Neural Network Model to Predict Relapse in Breast Cancer," 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 2018, pp. 351-358, Doi: 10.1109/ICMLA.2018.0005   DOI
10 http://www.swissadme.ch/
11 Mathew,T.E. (2019). A Logistic Regression with Recursive Feature Elimination Model for Breast Cancer Diagnosis. International Journal on Emerging Technologies, 10(3): 55-63.
12 https://www.genome.jp/kegg/drug
13 Qian Xu & Qiang Yang. (2011). A Survey of Transfer and Multitask Learning in Bioinformatics. Journal of Computing Science and Engineering. 5(3), 257-268.   DOI
14 Prashant Singh Rana, Harish Sharma, Mahua Bhattacharya & Anupam Shukla. (2015). Quality assessment of modelled protein structure using physicochemical properties. Journal of bioinformatics and computational biology.13(02).
15 E. A. Bayrak, P. Kirci and T. Ensari, "Comparison of Machine Learning Methods for Breast Cancer Diagnosis," 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), Istanbul, Turkey, 2019, pp. 1-3, doi: 10.1109/EBBT.2019.8741990.   DOI
16 Jiang, D., Lei, T., Wang, Z. et al. ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning. J Cheminform 12, 16 (2020). https://doi.org/10.1186/s13321-020-00421-y.   DOI
17 H. Motohashi, T. Teraoka, S. Aoki and H. Ohwada, "Regression Models and Ranking Method for p53 Inhibitor Candidates Using Machine Learning," 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Madrid, Spain, 2018, pp. 708-712, doi: 10.1109/BIBM.2018.8621142.   DOI
18 Preeti Jagadev and H.G.Virani, "Detection of Leukemia and its Types using Image Processing and Machine Learning", International conference on Trends in Electronics and Informatics (ICEI), IEEE 2017.
19 Sachin Paswan and Yogesh Rathore, "Recognition and Arrangement of Blood Cancer from Microscopic Cell pictures Utilizing Support Vector Machine K-Nearest Neighbor and Deep Learning", International Conference on Communication, Computing and Internet of Things (IC3IOT), IEEE, 2018.
20 Lind AP, Anderson PC. Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties. PLoS One. 2019 Jul 11;14(7): e0219774. Doi: 10.1371/journal.pone.0219774. PMID: 31295321; PMCID: PMC6622537.   DOI
21 C. Chen, L. Song, C. Bo and W. Shuo, "A Support Vector Machine with Particle Swarm Optimization Grey Wolf Optimizer for Network Intrusion Detection," 2021 International Conference on Big Data Analysis and Computer Science (BDACS), 2021, pp. 199-204, doi: 10.1109/BDACS53596.2021.00051.   DOI
22 J. Liu, X. Wei and H. Huang, "An Improved Grey Wolf Optimization Algorithm and its Application in Path Planning," in IEEE Access, vol. 9, pp. 121944-121956, 2021, doi: 10.1109/ACCESS.2021.3108973.   DOI
23 https://www.ebi.ac.uk/chembl/
24 G. Shobana and S. N. Bushra, "Classification of Myopia in Children using Machine Learning Models with Tree Based Feature Selection," 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), 2020, pp. 1599-1605, doi: 10.1109/ICECA49313.2020.9297623.   DOI
25 S. Mirjalili, S. M. Mirjalili, and A. Lewis, ''Grey wolf optimizer,'' Adv. Eng. Softw., vol. 69, pp. 46-61, Mar. 2014   DOI
26 https://scikit-learn.org/stable/
27 https://www.ibm.com/in-en/products/spss-statistics
28 G. Shobana and S. N. Bushra, "Prediction of Cardiovascular Disease using Multiple Machine Learning Platforms," 2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES), 2021, pp. 1-7, doi: 10.1109/ICSES52305.2021.9633797.   DOI