Browse > Article
http://dx.doi.org/10.7314/APJCP.2013.14.1.595

Early Detection of Lung Cancer Risk Using Data Mining  

Ahmed, Kawsar (Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University)
Abdullah-Al-Emran, Abdullah-Al-Emran (Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University)
Jesmin, Tasnuba (Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University)
Mukti, Roushney Fatima (Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University)
Rahman, Md. Zamilur (Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University)
Ahmed, Farzana (Department of Mathematics and Natural Science, BRAC University)
Publication Information
Asian Pacific Journal of Cancer Prevention / v.14, no.1, 2013 , pp. 595-598 More about this Journal
Abstract
Background: Lung cancer is the leading cause of cancer death worldwide Therefore, identification of genetic as well as environmental factors is very important in developing novel methods of lung cancer prevention. However, this is a multi-layered problem. Therefore a lung cancer risk prediction system is here proposed which is easy, cost effective and time saving. Materials and Methods: Initially 400 cancer and non-cancer patients' data were collected from different diagnostic centres, pre-processed and clustered using a K-means clustering algorithm for identifying relevant and non-relevant data. Next significant frequent patterns are discovered using AprioriTid and a decision tree algorithm. Results: Finally using the significant pattern prediction tools for a lung cancer prediction system were developed. This lung cancer risk prediction system should prove helpful in detection of a person's predisposition for lung cancer. Conclusions: Most of people of Bangladesh do not even know they have lung cancer and the majority of cases are diagnosed at late stages when cure is impossible. Therefore early prediction of lung cancer should play a pivotal role in the diagnosis process and for an effective preventive strategy.
Keywords
Data mining; pre-processing; disease diagnosis; aprioriTid algorithm; DT algorithm; Bangladesh;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Amorim R, Mirkin B (2012). Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering. Pattern Recognition, 45,1061-75.   DOI   ScienceOn
2 Brennan P, Hainaut P, Boffetta P (2011). Genetics of lung-cancer susceptibility. Lancet Oncol, 12, 399-408.   DOI   ScienceOn
3 Ferlay J, Shin HR, Bray F, et al (2010). GLOBOCAN 2008: cancer incidence and mortality worldwide: IARC, 10, 220-7.
4 Gothwal H, Kedawat S, Kumar R (2011). Cardiac arrhythmias detection in an ECG beat signal using fast fourier transform and artificial neural network. J Bio Sci Engineering, 4, 289-96.   DOI
5 Jayalakshmi T, Santhakumaran A (2010). A novel classification method for classification of diabetes mellitus using artificial neural networks. International Conference on Data Storage and Data Engineering. 159-63
6 Lan C, Liu Y, Tang Z (2010). Improvement of aprioritid algorithm for mining frequent items[J]. Computer Applications And Software, 27, 234-6.
7 Manaswini P, Ranjit KS (2011). Predict the onset of diabetes disease using artificial neural network (ANN). Int J Computer Sci & Emerging Technologies, 2, 303-11.
8 Muhammad ASapon, Khadijah Ismail, Suehazlyn Zainudin (2011). Prediction of diabetes by using artificial neural network. 2011 International Conference on Circuits, System and Simulation, 7, 299-303.
9 Schmid K, Kuwert T, Drexler H (2010). Radon in indoor spaces: an underestimated risk factor for lung cancer in environmental medicine. Dtsch Arztebl Int, 107, 181-6.
10 Smith L, Brinton LA, Spitz MR, et al (2012) Body mass index and risk of lung cancer among never, former, and current smokers. J Natl Cancer Inst, 104, 778-89.   DOI
11 Yael Ben-Haim , Elad Tom-Tov (2010) A streaming parallel decision tree algorithm. J Machine Learning Res, 11, 849-72.