Search | Korea Science

Text-independent Speaker Identification by Bagging VQ Classifier

Kyung, Youn-Jeong;Park, Bong-Dae;Lee, Hwang-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.2E
- /
- pp.17-24
- /
- 2001
In this paper, we propose the bootstrap and aggregating (bagging) vector quantization (VQ) classifier to improve the performance of the text-independent speaker recognition system. This method generates multiple training data sets by resampling the original training data set, constructs the corresponding VQ classifiers, and then integrates the multiple VQ classifiers into a single classifier by voting. The bagging method has been proven to greatly improve the performance of unstable classifiers. Through two different experiments, this paper shows that the VQ classifier is unstable. In one of these experiments, the bias and variance of a VQ classifier are computed with a waveform database. The variance of the VQ classifier is compared with that of the classification and regression tree (CART) classifier[1]. The variance of the VQ classifier is shown to be as large as that of the CART classifier. The other experiment involves speaker recognition. The speaker recognition rates vary significantly by the minor changes in the training data set. The speaker recognition experiments involving a closed set, text-independent and speaker identification are performed with the TIMIT database to compare the performance of the bagging VQ classifier with that of the conventional VQ classifier. The bagging VQ classifier yields improved performance over the conventional VQ classifier. It also outperforms the conventional VQ classifier in small training data set problems.
PDF

The Avata Construction System for Image Lossless Scaling (이미지 손실없는 확대/축소가 가능한 아바타 생성 시스템)

김원중;장미화
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.6 no.2
- /
- pp.181-189
- /
- 2002
In this paper, we designed and implemented Avata construction system using XML(extensible Markup Language) and SVG(Scalable Vector Graphic). The Web character created with Avata(or Web character) construction system are displayed in same (on without damage of image, regardless terminal type and user can modify and change image easily in form that want. Compare with existing Web character system, the Reusability of web character part element Is increased greatly with Avata construction system of this paper. Because SVG is described by text, graphic retrieval is convenient, and applications can use easily SVG document. Also, SVG can create web graphic document dynamically with database because can access easily in all graphic primitives of line, Polygon, text, image etc. As well as web character using study finding, we may develop usable technology to some contents on World Wide Web.
PDF KSCI

A Multi-Class Classifier of Modified Convolution Neural Network by Dynamic Hyperplane of Support Vector Machine

Nur Suhailayani Suhaimi;Zalinda Othman;Mohd Ridzwan Yaakub
- International Journal of Computer Science & Network Security
- /
- v.23 no.11
- /
- pp.21-31
- /
- 2023
In this paper, we focused on the problem of evaluating multi-class classification accuracy and simulation of multiple classifier performance metrics. Multi-class classifiers for sentiment analysis involved many challenges, whereas previous research narrowed to the binary classification model since it provides higher accuracy when dealing with text data. Thus, we take inspiration from the non-linear Support Vector Machine to modify the algorithm by embedding dynamic hyperplanes representing multiple class labels. Then we analyzed the performance of multi-class classifiers using macro-accuracy, micro-accuracy and several other metrics to justify the significance of our algorithm enhancement. Furthermore, we hybridized Enhanced Convolution Neural Network (ECNN) with Dynamic Support Vector Machine (DSVM) to demonstrate the effectiveness and efficiency of the classifier towards multi-class text data. We performed experiments on three hybrid classifiers, which are ECNN with Binary SVM (ECNN-BSVM), and ECNN with linear Multi-Class SVM (ECNN-MCSVM) and our proposed algorithm (ECNNDSVM). Comparative experiments of hybrid algorithms yielded 85.12 % for single metric accuracy; 86.95 % for multiple metrics on average. As for our modified algorithm of the ECNN-DSVM classifier, we reached 98.29 % micro-accuracy results with an f-score value of 98 % at most. For the future direction of this research, we are aiming for hyperplane optimization analysis.
https://doi.org/10.22937/IJCSNS.2023.23.11.3 인용 PDF

A Novel Video Image Text Detection Method

Zhou, Lin;Ping, Xijian;Gao, Haolin;Xu, Sen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.6 no.3
- /
- pp.941-953
- /
- 2012
A novel and universal method of video image text detection is proposed. A coarse-to-fine text detection method is implemented. Firstly, the spectral clustering (SC) method is adopted to coarsely detect text regions based on the stationary wavelet transform (SWT). In order to make full use of the information, multi-parameters kernel function which combining the features similarity information and spatial adjacency information is employed in the SC method. Secondly, 28 dimension classifying features are proposed and support vector machine (SVM) is implemented to classify text regions with non-text regions. Experimental results on video images show the encouraging performance of the proposed algorithm and classifying features.
https://doi.org/10.3837/tiis.2012.03.010 인용 PDF KSCI

A Novel Video Image Text Detection Method

Zhou, Lin;Ping, Xijian;Gao, Haolin;Xu, Sen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.6 no.4
- /
- pp.1140-1152
- /
- 2012
A novel and universal method of video image text detection is proposed. A coarse-to-fine text detection method is implemented. Firstly, the spectral clustering (SC) method is adopted to coarsely detect text regions based on the stationary wavelet transform (SWT). In order to make full use of the information, multi-parameters kernel function which combining the features similarity information and spatial adjacency information is employed in the SC method. Secondly, 28 dimension classifying features are proposed and support vector machine (SVM) is implemented to classify text regions with non-text regions. Experimental results on video images show the encouraging performance of the proposed algorithm and classifying features.
https://doi.org/10.3837/tiis.2012.04.011 인용 PDF KSCI

Text Extraction in HIS Color Space by Weighting Scheme

Le, Thi Khue Van;Lee, Gueesang
- Smart Media Journal
- /
- v.2 no.1
- /
- pp.31-36
- /
- 2013
A robust and efficient text extraction is very important for an accuracy of Optical Character Recognition (OCR) systems. Natural scene images with degradations such as uneven illumination, perspective distortion, complex background and multi color text give many challenges to computer vision task, especially in text extraction. In this paper, we propose a method for extraction of the text in signboard images based on a combination of mean shift algorithm and weighting scheme of hue and saturation in HSI color space for clustering algorithm. The number of clusters is determined automatically by mean shift-based density estimation, in which local clusters are estimated by repeatedly searching for higher density points in feature vector space. Weighting scheme of hue and saturation is used for formulation a new distance measure in cylindrical coordinate for text extraction. The obtained experimental results through various natural scene images are presented to demonstrate the effectiveness of our approach.
PDF

Construction of an expression vector for a filamentous fungus Neurospora crassa

Lee, Bheong-Uk
- Proceedings of the Zoological Society Korea Conference
- /
- 1999.10b
- /
- pp.263.1-263
- /
- 1999
No Abstract, See Full Text
PDF

Machine Printed and Handwritten Text Discrimination in Korean Document Images

Trieu, Son Tung;Lee, Guee Sang
- Smart Media Journal
- /
- v.5 no.3
- /
- pp.30-34
- /
- 2016
Nowadays, there are a lot of Korean documents, which often need to be identified in one of printed or handwritten text. Early methods for the identification use structural features, which can be simple and easy to apply to text of a specific font, but its performance depends on the font type and characteristics of the text. Recently, the bag-of-words model has been used for the identification, which can be invariant to changes in font size, distortions or modifications to the text. The method based on bag-of-words model includes three steps: word segmentation using connected component grouping, feature extraction, and finally classification using SVM(Support Vector Machine). In this paper, bag-of-words model based method is proposed using SURF(Speeded Up Robust Feature) for the identification of machine printed and handwritten text in Korean documents. The experiment shows that the proposed method outperforms methods based on structural features.
PDF KSCI

An Optimal Weighting Method in Supervised Learning of Linguistic Model for Text Classification

Mikawa, Kenta;Ishida, Takashi;Goto, Masayuki
- Industrial Engineering and Management Systems
- /
- v.11 no.1
- /
- pp.87-93
- /
- 2012
This paper discusses a new weighting method for text analyzing from the view point of supervised learning. The term frequency and inverse term frequency measure (tf-idf measure) is famous weighting method for information retrieval, and this method can be used for text analyzing either. However, it is an experimental weighting method for information retrieval whose effectiveness is not clarified from the theoretical viewpoints. Therefore, other effective weighting measure may be obtained for document classification problems. In this study, we propose the optimal weighting method for document classification problems from the view point of supervised learning. The proposed measure is more suitable for the text classification problem as used training data than the tf-idf measure. The effectiveness of our proposal is clarified by simulation experiments for the text classification problems of newspaper article and the customer review which is posted on the web site.
https://doi.org/10.7232/iems.2012.11.1.087 인용 PDF KSCI KPUBS

Inverted Index based Modified Version of K-Means Algorithm for Text Clustering

Jo, Tae-Ho
- Journal of Information Processing Systems
- /
- v.4 no.2
- /
- pp.67-76
- /
- 2008
This research proposes a new strategy where documents are encoded into string vectors and modified version of k means algorithm to be adaptable to string vectors for text clustering. Traditionally, when k means algorithm is used for pattern classification, raw data should be encoded into numerical vectors. This encoding may be difficult, depending on a given application area of pattern classification. For example, in text clustering, encoding full texts given as raw data into numerical vectors leads to two main problems: huge dimensionality and sparse distribution. In this research, we encode full texts into string vectors, and modify the k means algorithm adaptable to string vectors for text clustering.
https://doi.org/10.3745/JIPS.2008.4.2.067 인용 PDF KSCI

Search Result 284, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)