DOI QR코드

DOI QR Code

A Novel Feature Selection Approach to Classify Breast Cancer Drug using Optimized Grey Wolf Algorithm

  • Shobana, G. (Department of Computer Applications Madras Christian College) ;
  • Priya, N. (PG Department of Computer Science SDNB Vaishnav College for Women)
  • Received : 2022.09.05
  • Published : 2022.09.30

Abstract

Cancer has become a common disease for the past two decades throughout the globe and there is significant increase of cancer among women. Breast cancer and ovarian cancers are more prevalent among women. Majority of the patients approach the physicians only during their final stage of the disease. Early diagnosis of cancer remains a great challenge for the researchers. Although several drugs are being synthesized very often, their multi-benefits are less investigated. With millions of drugs synthesized and their data are accessible through open repositories. Drug repurposing can be done using machine learning techniques. We propose a feature selection technique in this paper, which is novel that generates multiple populations for the grey wolf algorithm and classifies breast cancer drugs efficiently. Leukemia drug dataset is also investigated and Multilayer perceptron achieved 96% prediction accuracy. Three supervised machine learning algorithms namely Random Forest classifier, Multilayer Perceptron and Support Vector Machine models were applied and Multilayer perceptron had higher accuracy rate of 97.7% for breast cancer drug classification.

Keywords

References

  1. https://www.genome.jp/kegg/drug
  2. Qian Xu & Qiang Yang. (2011). A Survey of Transfer and Multitask Learning in Bioinformatics. Journal of Computing Science and Engineering. 5(3), 257-268. https://doi.org/10.5626/JCSE.2011.5.3.257
  3. Prashant Singh Rana, Harish Sharma, Mahua Bhattacharya & Anupam Shukla. (2015). Quality assessment of modelled protein structure using physicochemical properties. Journal of bioinformatics and computational biology.13(02).
  4. Mathew,T.E. (2019). A Logistic Regression with Recursive Feature Elimination Model for Breast Cancer Diagnosis. International Journal on Emerging Technologies, 10(3): 55-63.
  5. E. A. Bayrak, P. Kirci and T. Ensari, "Comparison of Machine Learning Methods for Breast Cancer Diagnosis," 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), Istanbul, Turkey, 2019, pp. 1-3, doi: 10.1109/EBBT.2019.8741990.
  6. H. Motohashi, T. Teraoka, S. Aoki and H. Ohwada, "Regression Models and Ranking Method for p53 Inhibitor Candidates Using Machine Learning," 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Madrid, Spain, 2018, pp. 708-712, doi: 10.1109/BIBM.2018.8621142.
  7. David Chen, Gaurav Goyal, Ronald Go, Sameer Parikh, Che Ngufor, "Predicting Time to First Treatment in Chronic Lymphocytic Leukemia using Machine Learning Survival and Classification Methods", IEEE International Conference on Healthcare Informatics, IEEE, 2018.
  8. Dharani T and Hariprasath S, "Diagnosis of Leukemia and its types Using Digital Image Processing Tehniques", Proceedings of the International Conference on Communication and Electronics Systems (ICCES), IEEE, 2018.
  9. Preeti Jagadev and H.G.Virani, "Detection of Leukemia and its Types using Image Processing and Machine Learning", International conference on Trends in Electronics and Informatics (ICEI), IEEE 2017.
  10. Jakkrich Laosai and Kosin Chamnongthai, "Deep-learning-Based Acute Leukemia Classification Using Imaging Flow Cytometry and Morphology", International Workshop on Smart Info-Media Systems in Asia (SISA), IEEE, 2018.
  11. Sachin Paswan and Yogesh Rathore, "Recognition and Arrangement of Blood Cancer from Microscopic Cell pictures Utilizing Support Vector Machine K-Nearest Neighbor and Deep Learning", International Conference on Communication, Computing and Internet of Things (IC3IOT), IEEE, 2018.
  12. Jiang, D., Lei, T., Wang, Z. et al. ADMET evaluation in drug discovery. 20. Prediction of breast cancer resistance protein inhibition through machine learning. J Cheminform 12, 16 (2020). https://doi.org/10.1186/s13321-020-00421-y.
  13. Borrero, Luz & Guette, Lilibeth & Lopez, Enrique & Pineda, Omar & Buelvas, Edgardo. (2020). Predicting Toxicity Properties through Machine Learning. Procedia Computer Science. 170. 1011-1016. 10.1016/j.procs.2020.03.093.
  14. Baptista D, Ferreira PG, Rocha M. Deep learning for drug response prediction in cancer. Brief Bioinform. 2021 Jan 18;22(1):360-379. Doi: 10.1093/bib/bbz171. PMID: 31950132.
  15. Lind AP, Anderson PC. Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties. PLoS One. 2019 Jul 11;14(7): e0219774. Doi: 10.1371/journal.pone.0219774. PMID: 31295321; PMCID: PMC6622537.
  16. A. Jha, G. Verma, Y. Khan, Q. Mehmood, D. Rebholz-Schuhmann and R. Sahay, "Deep Convolution Neural Network Model to Predict Relapse in Breast Cancer," 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 2018, pp. 351-358, Doi: 10.1109/ICMLA.2018.0005
  17. C. Chen, L. Song, C. Bo and W. Shuo, "A Support Vector Machine with Particle Swarm Optimization Grey Wolf Optimizer for Network Intrusion Detection," 2021 International Conference on Big Data Analysis and Computer Science (BDACS), 2021, pp. 199-204, doi: 10.1109/BDACS53596.2021.00051.
  18. J. Liu, X. Wei and H. Huang, "An Improved Grey Wolf Optimization Algorithm and its Application in Path Planning," in IEEE Access, vol. 9, pp. 121944-121956, 2021, doi: 10.1109/ACCESS.2021.3108973.
  19. M. -Z. Tsai, P. -Y. Yang, F. -I. Chou and J. -H. Chou, "Parameters Optimization for Improved Grey Wolf Optimizer by Using Uniform Experimental Design," 2021 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), 2021, pp. 1-2, doi: 10.1109/ISPACS51563.2021.9651072.
  20. S. Subbiah, K. S. M. Anbananthen, S. Thangaraj, S. Kannan and D. Chelliah, "Intrusion detection technique in wireless sensor network using grid search random forest with Boruta feature selection algorithm," in Journal of Communications and Networks, vol. 24, no. 2, pp. 264-273, April 2022, doi: 10.23919/JCN.2022.000002.
  21. https://www.ebi.ac.uk/chembl/
  22. http://www.swissadme.ch/
  23. S. Mirjalili, S. M. Mirjalili, and A. Lewis, ''Grey wolf optimizer,'' Adv. Eng. Softw., vol. 69, pp. 46-61, Mar. 2014 https://doi.org/10.1016/j.advengsoft.2013.12.007
  24. https://scikit-learn.org/stable/
  25. G Shobana, N Priya, A New Multi-Phase Feature Selection Framework for The Prediction of Breast Cancer Drug Using Machine Learning Techniques, Journal of Algebraic Statistics 13 (2), 300-312(2022).
  26. https://www.ibm.com/in-en/products/spss-statistics
  27. G. Shobana and S. N. Bushra, "Classification of Myopia in Children using Machine Learning Models with Tree Based Feature Selection," 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), 2020, pp. 1599-1605, doi: 10.1109/ICECA49313.2020.9297623.
  28. G. Shobana and S. N. Bushra, "Prediction of Cardiovascular Disease using Multiple Machine Learning Platforms," 2021 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES), 2021, pp. 1-7, doi: 10.1109/ICSES52305.2021.9633797.