I. INTRODUCTION
India has one of the highest rates in oral cancer in the world partly attributed to high prevalence of tobaccochewing. In parallel to the increase in Oral Cancer, border-line malignant lesions which range from epithelial dysplasia to intra epithelial carcinoma have also increased in numbers [1]. In the past few years deep learning has proven to be efficient in many classification problemsespecially in medical imaging data. In the last few years alarge number of machine learning based frameworksespecially deep learning were employed by researchers todetect and classify the oral cancer. These deep learning based systems have proved to be very efficient and has achieved accuracy almost equal to the specialistpathologist that does its work manually and has ample of experience in his field [2].
The term dysplasia was introduced by Reagon in 1958 in a study where he described the features of dysplasia [3]. In medical terms dysplasia means an abnormaldevelopment in cells while histomorphologically any cellular or structural change in epithelium is dysplasia. Major pathological microscopic changes at cellular levelas well as tissue level in dysplastic tissue are given in Table 1: [4].
Table 1: Pathological microscopic changes at the cellular level and tissue level in dysplasia
Dysplasia is a part of the pathway to malignancy. Dysplastic cells cannot be called malignant until and unless it invades the connective tissue or metastasize. The dysplastic cells can be classified into mild, moderate and severe dysplasia.
The progressive change in dysplastic cells from mild tomoderate and then from moderate to severe can be seen in the epithelium. The invasive activity of dysplastic cells starts from the lower layer, then progress to the middlelayer and finally full thickness of epithelium. Next, it willinvade the basement membrane [5].
Fig. 1. Showing the progression of dysplasia from normalepithelium to in-citu carcinoma.
As the dysplasia arises in the basal layer of the epitheliumand extends, with progression, to the upper epitheliallayers, the scheme classifies mild dysplasia asinvolvement of the lower third of the epithelium only, moderate dysplasia as extension to the middle third, and severe dysplasia as extension to the superficial third of the epithelium [6].
Early detection plays a key role in cancer diagnosis and can improve long-term survival rates. Medical imaging is a very important technique for early cancer detection and diagnosis. As is well known, medical imaging has been widely employed for early cancer detection, monitoring, and follow-up after the treatments [7].
This paper aims to classify oral dysplasia by using populardeep learning technique namely Convolutional Neural Network. Since our data is in image format that’s why CNN (Convolutional Neural Network) is preferable overother deep learning techniques. Recent studies show that CNN achieve promise performance in cancer detection and diagnosis. [8, 9].
II. LITERATURE REVIEW
This section is intended to provide a brief review of recent studies on applying deep learning for early cancerdetection, cancer diagnosis and prognosis.
In recent years, a bunch of papers has been published about the application of deep learning in cancer detection and diagnosis. In [10], Gustav Forslid et al. have used CNN on oral cancer dataset. Two different CNN models were used namely ResNet and VGG. Results were very good as compared to other image diagnosis techniques. In [11], Geert Litjens et al. have presented Prostate canceridentification in biopsy specimens and breast cancermetastasis detection in sentinel lymph nodes using deeplearning and concluded that deep learning improves the accuracy of prostate cancer diagnosis and breast cancerstaging. Authors employed CNN. In [12], Albayrak et al. developed a deep learning based feature extractionalgorithm to detect mitosis in breast histopathologicalimages. In the proposed algorithm, the CNN model was used to extract features which were used to train a support vector machine (SVM) for the detection of mitosis. In [13], Krizhevsky et al. used AlexNet to construct a CNN model to classify benign or malignant tumors from the breasthistopathological images. In [14], Akshay Iyer et al. have used the pre-trained model, VGG19 to extract information from the pathological images specific for lung cancer. Amodel consisting of deep convolution network-based image classification has been proposed for predictions on mutations in genes signature information of lung canceramong Indian populations. In [15], Bassma El-Sherbiny et al. propose - Brain/Lung/Breast (BLB) automated detection system. It precisely predicts the occurrence of cancer and segments the expected region of tumor/cancerin MRI/CT scan/Mammography images. This systemproposes different classification techniques including Support Vector Machine (SVM), ExtraTrees and convolutional neural network (CNN). CNN performed exceptionally well in the detection of lung cancer. In [16], Chen et al. proposed a deep cascade network for mitosis detection in breast histology slides. They first trained a fully connected network model to extract mitosiscandidates from the whole histology slides and then fine-tuned a CaffeNet model for the classification of mitosis. Three networks with different configurations of fully connected layers were trained and the scores wereaveraged to generate the final output. In [17], I da Arvidsson et al. compared two different techniques; by training the networks using color augmentation and by using digital stain separation using an auto encoder. The author achieved accuracies of 95% for classification of benign versus malignant tissue and 81% for Gleasongrading for data from the same site as the training data. In [18], Albarqouni et al. explored deep CNN in a biomedical context, a multiscale CNN architecture was developed with an aggregation layer was introduced after the softmax layer to aggregate the prediction results with the annotation results from multiple participation. In [19], Sawon Pratiheret al. have deployed deep learning framework to classify elastic scattering spectra of biological tissues into normal and cancerous ones. Authorsexperimented to show the superiority of the convolutional neural network extracted deep features over classical hand crafted biomarkers. The proposed method employselastic scattering spectra of the tissues as input to CNN. In [20], Wichakam et al. proposed a combined system consisting of CNN and SVM for mass detection on digital mammograms. CNN was used on mammographic patchesto get the high-level feature representation of the image. This high-level feature set was used as input to SVM forclassification of mammograms. In [21], Xu et al. proposed a stacked sparse auto-encoder based algorithm to classify nuclei in breast cancer histopathology. In [22], Tian Xia et al. proposed a system of tumor classification by pre-training a CNN from samples of different tissue types inhistopathological images, then fine-tuning the obtained pre-trained on a particular tissue type. CNN showed improvement over training from scratch with limited data. In [23], Mina Khoshdeli et al. applied convolutional neural networks for grading of the tumor and decomposing tumor architecture from H&E stained histology sections of the kidney. In [24], Xu Y et al. proposed deep convolutional neural network activationfeatures to perform classification, segmentation and visualization in large-scale tissue histopathology images. Authors used a pre-trained ImageNet network onfeatures extracted by CNN.
III. MATERIAL AND METHODOLOGY
In this study we have taken data of 52 patients suffering from oral dysplasia. 2 H&E stained histopathological samples are taken from each patient. The data is taken from Indira Gandhi Govt. Dental College and Hospital, Jammu, India. Based on CIN (Cervical intraepithelial neoplasia) classification, these data samples were classified into 4 different classes namely normal, milddysplasia, moderate dysplasia and severe dysplasia.
There are a total of 2688 images in our dataset. 663 images belonging to a normal class, 681 belonging to milddysplasia, 675 belonging to moderate dysplasia and 669 belonging to severe dysplasia.
CNN was created and trained from scratch. Pythonlanguage along with the Tensor flow and Keras deeplearning libraries were used to build the CNN model.
Fig. 2. Showing Basic structure of a CNN.
Dataset was divided into training set and testing set in a 70:30 ratio respectively. The training set contained 1882 images and testing set contained 806 images. The research methodology diagram is given in figure 4:
Fig. 3. CNN Model Layer wise.
Fig. 4. Research methodology diagram.
IV. RESULTS AND DISCUSSION
The classification of the dysplastic tissue was carried out by CNN into 4 different classes. When CNN was trained on training data it gave anaccuracy of 91.65% and in testing, it gave an accuracy of 89.3%. Our CNN was trained for a total of 75 epochs. The Accuracy and loss of model is given in figure 5(a) and figure 5(b) respectively.
Fig. 5. Training results.
Our proposed CNN produced the following output of classified tissue images along with labels:
Fig. 6. Predicted Tissue images along with labels.
The confusion matrix obtained from the experiment is given in figure 3:
Fig. 7. Confusion matrix for testing of 806 oral epithelial tissue images.
Precision, recall, f1-score for each class and average of all classes are given in Table 2.
Table 1. Precision, Recall and f1-score of testing.
V. CONCLUSION
Our proposed deep learning system although not in realtime but produced promising results. Our proposed deeplearning system has achieved higher accuracies almost as the experienced specialist concerned oral pathologist.
VI. FUTURE SCOPE
In the future, we will increase the dataset size to input more and more training data so that our proposed deeplearning model will learn more efficiently. Also, we will fine-tune our deep learning model so that it can achievemore accuracy in predicting the true label.
VII. ACKNOWLEDGMENTS
I would like to thank the faculty of the Department of Oral Pathology and Department of Oral Medicine and Radiology from the Institute of Indira Gandhi Govt. Dental College and Hospital for their timely support inundertaking this research work.
참고문헌
- Yoshiko Ariji, Motoki Fukuda, Yoshitaka Kise, Michihito Nozawa, Yudai Yanashita, Hiroshi Fujita, Akitoshi Katsumata, Eiichiro Ariji, "Contrast-enhanced computed tomography image assessment of cervical lymph node metastasis in patients with oral cancer by using a deep learning system of artificial intelligence", Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology , 2018
- Gnananandar G, Masthan K, Babu A. N, Jacobina J.," Histomorphometric Analysis of oral Epithelial Dysplasia and oral Squamous Cell Carcinoma". Biomed Pharmacology Journal, 2017
- Dena J. Fischer, Joel B. Epstein, Thomas H. Morton, Stephen M. Schwartz," Interobserver reliability in the histopathologic diagnosis of oral pre-malignant and malignant lesions", Journal of Oral Pathology Medicine, Vol. 33, No. 2, pp. 65-70, 2004. https://doi.org/10.1111/j.1600-0714.2004.0037n.x
- Barnes L, Eveson JW, Reichart P, Sidransky D, editors. "WHO classification of tumours: pathology and genetics of head and neck tumours". Lyon: ARC Press; 2005.
- S. Warnakulasuriya, "Histological grading of oral epithelial dysplasia: revisited". Journal of Pathology. pp. 294-297, 2001.
- Hong D. Xiao, Carlo B. Bifulco, "Cellular and Molecular Pathology, Oral, Head and Neck Oncology and Reconstructive Surgery", Elsevier, pp. 57-78, 2018.
- Zilong Hu , Jinshan Tang , Ziming Wang , Kai Zhang , Lin Zhang , Qingling Sun , "Deep Learning for Image-based Cancer Detection and Diagnosis A Survey", Pattern Recognition 2018.
- Y. LeCun, Y. Bengio, G. Hinton, "Deep learning", Nature, Vol. 521, pp. 436-444, 2015. https://doi.org/10.1038/nature14539
- G. Forslid; Håkan Wieslander; Ewert Bengtsson; Carolina Wählby; Jan-Michael Hirsch; Christina Runow Stark; Sajith Kecheril Sadanandan, "Deep Convolutional Neural Networks For Detecting Cellular Changes Due To Malignancy". IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, pp. 82-89, 2017.
- G. Litjens, Clara I. Sanchez, Nadya Timofeeva, Meyke Hermsen, Iris Nagtegaal, Iringo Kovacs, Christina Hulsbergen - van de Kaa, Peter Bult, Bram van Ginneken & Jeroen van der Laak, "Deep learning as a tool for increased accuracy and efficiency of Histopathological diagnosis", Scientific Reports, Vol. 6, pp. 262-86, 2016.
- A. Albayrak, G. Bilgin, "Mitosis detection using convolutional neural network based features", IEEE 17th International Symposium on Computational Intelligence and Informatics (CINTI), pp. 335-340, 2017.
- A. Krizhevsky, I. Sutskever, G.E. Hinton, "Imagenet classification with deep convolutional neural networks", Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.
- A. Iyer, A. Vyshnavi, M. Hima, P. K. Namboori, Krishnan, "Deep Convolution Network-Based Prediction Model For Medical Diagnosis Of Lung Cancer - A Deep Pharmacogenomic Approach: deep diagnosis for lung cancer," Second International Conference on Advances in Electronics, Computers and Communications, pp.1-4, 2018.
- Bassma El-Sherbiny, Nardeen Nabil, Youssef Emad, Nada Ayman, Taraggy Mohiy, Ashraf Abdel Raouf, "BLB (Brain/Lung cancer detection and segmentation and Breast Dense calculation)," First International Workshop on Deep and Representation Learning (IWDRL), Cairo, pp. 41-47, 2018.
- H. Chen, Q. Dou, X. Wang, J. Qin, P.-A. Heng, "Mitosis Detection in Breast Cancer Histology Images via Deep Cascaded Networks", AAAI, pp. 1160-1166, 2016.
- Ida Arvidsson, Niels Christian Overgaard, Felicia-Elena Marginean, Agnieszka Krzyzanowska, Anders Bjartell, Kalle Åstrom, Anders Heyden, "Generalization of prostate cancer classification for multiple sites using deep learning", IEEE 15th International Symposium on Biomedical Imaging, Washington, DC, pp. 191-194, 2018.
- S. Albarqouni, C. Baur, F. Achilles, V. Belagiannis, S. Demirci, N. Navab, "Aggnet: deep learning from crowds for mitosis detection in breast cancer histology images", IEEE transactions on Medical Imaging, Vol. 35, pp. 1313-1321, 2016. https://doi.org/10.1109/TMI.2016.2528120
- Sawon Pratiher, Shubhobrata Bhattacharya, Sabyasachi Mukhopadhyay, Nirmalya Ghosh, Gautham Pasupuleti, Prasanta K. Panigrahi, "Deep features using a convolutional neural network for early-stage cancer detection" Proceedings Volume 10679, Optics, Photonics, and Digital Technologies for Imaging Applications Vol. 1067902, 2018.
- I. Wichakam, P. Vateekul, "Combining deep convolutional networks and SVMs for mass detection on digital mammograms", 8th International Conference on Knowledge and Smart Technology (KST), IEEE, pp.239-244, (2016).
- J. Xu, L. Xiang, R. Hang, J. Wu, "Stacked Sparse Autoencoder (SSAE) based framework for nuclei patch classification on breast cancer histopathology", 2014 IEEE 11th International Symposium on Biomedical Imaging, IEEE, pp. 999-1002, 2014.
- T. Xia, A. Kumar, D. Feng and J. Kim, "Patchlevel Tumor Classification in Digital Histopathology Images with Domain Adapted Deep Learning," 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, pp. 644-647, 2018.
- M. Khoshdeli, A. Borowsky and B. Parvin, "Deep Learning Models Differentiate Tumor Grades from H&E Stained Histology Sections," 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Honolulu, HI, pp. 620-623, 2018.
- Y. Xu, Z. Jia, L. B. Wang, Y. Ai, F. Zhang, M. Lai, E. I. Chang, "Large scale tissue histopathology image classification, segmentation, and visualization via deep convolutional activation features," BMC Bioinformatics, Vol. 18, No. 1, p. 281, 2016.