Browse > Article
http://dx.doi.org/10.9717/kmms.2022.25.5.661

Segmentation of the Glottis and Quantitative Measurement of the Vocal Cord Mucosal Morphology in the Laryngoscopic Image  

Lee, Seon Min (Dept. of Nursing, College of Nursing, Gachon University)
Oh, Seok (Dept. of Biomedical Engineering, Gil Medical Center, College of Medicine, Gachon University)
Kim, Young Jae (Dept. of Biomedical Engineering, Gil Medical Center, College of Medicine, Gachon University)
Woo, Joo Hyun (Dept. of Otorhinolaryngology-Head and Neck Surgery, Gil Medical Center, College of Medicine, Gachon University)
Kim, Kwang Gi (Dept. of Health Sciences & Technology, Gachon Advanced Institute for Health Sciences & Tecnology (GAIHST), Gachon University)
Publication Information
Abstract
The purpose of this study is to compare and analyze Deep Learning (DL) and Digital Image Processing (DIP) techniques using the results of the glottis segmentation of the two methods followed by the quantification of the asymmetric degree of the vocal cord mucosa. The data consists of 40 normal and abnormal images. The DL model is based on Deeplab V3 architecture, and the Canny edge detector algorithm and morphological operations are used for the DIP technique. According to the segmentation results, the average accuracy of the DL model and the DIP was 97.5% and 94.7% respectively. The quantification results showed high correlation coefficients for both the DL experiment (r=0.8512, p<0.0001) and the DIP experiment (r=0.7784, p<0.0001). In the conclusion, the DL model showed relatively higher segmentation accuracy than the DIP. In this paper, we propose the clinical applicability of this technique applying the segmentation and asymmetric quantification algorithm to the glottal area in the laryngoscopic images.
Keywords
Laryngoscopy; Vocal Cord; Segmentation; Quantitative measurement; Deep Learning; Digital Image Processing;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 S.H. Lee, "Diagnostic Role of Stroboscopy," Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics, Vol. 21, No. 1, pp. 13-16, 2010.
2 T.K. Kwon and M.S. Kim, "Current Standpoints on the Pathophysiology of Benign Vocal Fold Lesions," Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics, Vol. 26, No. 2, pp. 91-93, 2015.   DOI
3 S.G. Wang and H.J. Park, "Laryngeal Stroboscopy," Journal of Clinical Otolaryngology Head and Neck Surgery, Vol. 26, No. 1, pp. 3-9, 2015.   DOI
4 W.J. Cha, S.G. Wang, J.Y. Jang, G.H. Kim, and Y.W. Lee, "Post-Processing of High-Speed Video-Laryngoscopic Images to Two-Dimensional Scanning Digital Kymographic Images," Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics, Vol. 28, No. 2, pp. 89-95, 2017.   DOI
5 J.C. Lee and I.H. Bae, "Recent Advances in Examination of Vocal Fold Vibration," Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics, Vol. 32, No. 1, pp. 1-8, 2021.   DOI
6 S.H. Choi and C.H. Choi, "Qualitative Analysis of Vocal Fold Vibration in Normal Adults with a High-Speed Video System," Communication Sciences and Disorders, Vol. 17, No. 4, pp. 591-602, 2012.
7 B.S. Kim and I.H. Lee, "Retinal Blood Vessel Segmentation using Deep Learning," The Journal of Korean Institute of Information Technology, Vol. 17, No. 5, pp. 77-82, 2019.   DOI
8 G.Y. Kwon, J.H. Kim, Y.J. Kim, S.M. Lee, and K.G. Kim, "Detecting Boundary of Erythema Using Deep Learning," Journal of Korea Multimedia Society, Vol. 24, No. 11, pp. 1492-1499, 2021.   DOI
9 Y. Wang, S. Sun, J. Yu, and D. Yu, "Skin Lesion Segmentation using Atrous Convolution via DeepLab V3," arXiv Preprint, arXiv: 1807.08891, 2018.
10 K. Zuiderveld, "Contrast Limited Adaptive Histogram Equalization," Graphics Gems, pp. 474-485, 1994.
11 L.-C. Chen, G. Papandreou, F. Schroff, and H. Adam, "Rethinking Atrous Convolution for Semantic Image Segmentation," arXiv Preprint, arXiv:1706.05587, 2017.
12 K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
13 M. Kunduk, M. Doellinger, A.J. McWhorter, and J. Lohscheller, "Assessment of the Variability of Vocal Fold Dynamics within and between Recordings with High-Speed Imaging and by Phonovibrogram," The Laryngoscope, Vol. 120, No. 5, pp. 981-987, 2010.   DOI
14 D.P. Kingma and J. Ba, "Adam: A Method for Stochastic Optimization," arXiv Preprint, arXiv:1412.6980, 2014.
15 R.M. Haralick, S.R. Sternberg, and X. Zhuang, "Image Analysis using Mathematical Morphology," IEEE Transactions on Pattern Analysis and Machine Intelligence, No. 4, pp. 532-550, 1987.
16 R. Kohavi, "A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection," International Joint Conferences on Artificial Intelligence Organization, Vol. 14, No. 2, pp. 1137-1145, 1995.
17 S. Subramaniam, K. Jayanthi, C. Rajasekaran, and R. Kuchelar, "Deep Learning Architectures for Medical Image Segmentation," IEEE 33rd International Symposium on Computer- Based Medical Systems (CBMS), pp. 579-584, 2020.
18 Y.J. Kim and K.G. Kim, "Development of an Optimized Deep Learning Model for Medical Imaging," Journal of the Korean Society of Radiology, Vol. 81, No. 6, pp. 1274-1289, 2020.   DOI
19 J. Canny, "A Computational Approach to Edge Detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, No. 6, pp. 679-698, 1986.
20 Y.H. Cho, "An Efficient Lesion Segmentation by Using ROI Extraction and Combination of Features in Digital Skin Images," Journal of Korean Institute of Intelligent Systems, Vol. 30, No. 6, pp. 472-481, 2020.   DOI
21 J.P. Kim and Y.B. Cho, "Image Processing of Lesion in Iris Image Using OpenCV," Journal of Digital Contents Society, Vol. 19, No. 11, pp. 2035-2040, 2018.   DOI