Browse > Article
http://dx.doi.org/10.9708/jksci.2022.27.10.059

Gender Classification of Speakers Using SVM  

Han, Sun-Hee (Dept. of Computer Science, Inha Technical College)
Cho, Kyu-Cheol (Dept. of Computer Science, Inha Technical College)
Abstract
This research conducted a study classifying gender of speakers by analyzing feature vectors extracted from the voice data. The study provides convenience in automatically recognizing gender of customers without manual classification process when they request any service via voice such as phone call. Furthermore, it is significant that this study can analyze frequently requested services for each gender after gender classification using a learning model and offer customized recommendation services according to the analysis. Based on the voice data of males and females excluding blank spaces, the study extracts feature vectors from each data using MFCC(Mel Frequency Cepstral Coefficient) and utilizes SVM(Support Vector Machine) models to conduct machine learning. As a result of gender classification of voice data using a learning model, the gender recognition rate was 94%.
Keywords
Feature vectors; Voice; Classification; Mel frequency cepstral coefficient; Support vector machine;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Eunsuk Kim, Kyungwook Shin, "A design of FFT processor for EEG signal analysis", Journal of the Korea Institute of Information and Communication Engineering, Vol 14, No. 11, pp. 2548-2554, October 2010. DOI: 10.6109/jkiice.2010.11.30   DOI
2 Hyunchul Ahn, Kyoungjae Kim, Ingoo Han, "Purchase Prediction Model using the Support Vector Machine", Journal of Intelligence and Information Systems Society, Vol. 11, No. 3, pp. 69-81, December 2005.
3 Jiwon Park, Chanuk Yeom, Keunchang Kwak, "Comparison of building heating and cooling load prediction performance using Gaussian kernel regression model and SVM regression model", The Korean Institute of Electrical Engineers, pp. 162-164, July 2021.
4 Byeong-Goo Jeong, Jae-Seung Choi, "Comparison of Characteristic Vector of Speech for Gender Recognition of Male and Female", The Korea Institute of Information and Communication Engineering, Vol. 16, No. 7, pp. 1370-1376, July 2012.   DOI
5 Yoontae Oh, "Comparative experiment of Logistic Regression and Machine Learning Performance of Support Vector Machines", Korea's Master's degree thesis: Kookmin University's Graduate School, August 2019.
6 Mijin Kim, Chaewon Yoo, Huijin Park, Soobin Ou, Jongwoo Lee, " Implementation of Voice Recognizing KIOSK Application for the Visually Impaired", KIISE Transactions on Computing Practices, Vol. 26, No. 7, pp. 332-337, July 2020. DOI: 10.5626/KTCP.2020.26.7.332   DOI
7 Hyunsoo Bae, Hojin Lee, Sukgyu Lee, "Voice Recognition-Based on Adaptive MFCC and Deep Learning for Embedded Systems", Journal of Institute of Control, Robotics and Systems, Vol. 22, No. 10, pp. 797-802, October 2016. DOI: 10.5302/J.ICROS.2016.16.0136   DOI
8 Junryul Park, Bonwoo Koo, Jaehyung Jung, Taein Heo, Miyoung Lee, Sungwook Baek, "Food Delivery Service Customer'sSpeech Gender Identification Using SVM", Communications of the Korean Institute of Information Scientists and Engineers, pp. 1979-1981, June, 2016.
9 Kwanshik Shim, Haekon Nam, "A Fast Parameter Estimation of Time Series Data Using Discrete Fourier Transform", THE TRANSACTION OF THE KOREAN INSTITUTE OF ELECTRICAL ENGINEERS A, Vol. 55A, No. 7, pp. 265-272, July 2006.
10 Seungdo Jeong, "Speaker Identification Using Dynamic Time Warping Algorithm", The Korea Academia-Industrial Cooperation Society, Vol. 12, No. 5, pp. 2402-2409, May 2011. DOI: 10.5762/kais.2011.12.5.2402   DOI
11 Hyunggeun Lee, Yongmin Hong, Sungwoo Kang, "Identifying Process Capability Index for Electricity Distribution System through Thermal Image Analysis", Journal of Korean Society for Quality Management, Vol. 49, No. 3, pp. 327-340, September 2021. DOI: 10.7496/JKSQM.2021.49.3.327   DOI
12 Hyunjin Hwang, Kyungjin Min, Jongmin Moon, Sangyeob Lee, Dongjun Kim, Kyeongsup Kim, Jeongwhan Lee, "A Study on Classification of Vocal Sound Based on Mel Frequency Cepstral", Theory.INFORMATION AND CONTROL SYMPOSIUM, pp. 346-347, October 2020.
13 Soyeon Min, "The Flattening Algorithm of Speech Spectrum by Quadrature Mirror Filter", Korea Academy Industrial Cooperation Society, Vol. 7, No. 5, pp. 907-912, October 2006.
14 Byungoh Yoo, Joonhyung Park, Yongbae Park, Suyoung Jung, Kwangsoo Lee, "Assessment of the Distributional Probability for Evergreen Broad-Leaved Forests(EBLFs) Using a Logistic Regression Model", Journal of the Korean Association of Geographic Information Studies, Vol. 19, No. 1, pp. 94-105, 2016. DOI: 10.11108/kagis.2016.19.1.094   DOI
15 Taegyun Im, Keunsung Bae, Chansik Hwang, Hyungwook Lee, "Classification of Underwater Transient Signals Using MFCC Feature Vector", The Korean Institute of Commucations and Information Sciences, Vol. 32, No. 8, pp. 675-680, 2007.
16 Jiwoo Choi, Sangil Choi, Taewon Kang, " Identification of Gait Patterns using Convolutional Neural Networks for Personal Authentication", The Journal of Korean Institute of Information Technology, Vol. 20, No. 4, pp. 13-23, April 2022. DOI: 10.14801/jkiit.2022.20.4.13   DOI