Search | Korea Science

Korean continuous digit speech recognition by multilayer perceptron using KL transformation (KL 변환을 이용한 multilayer perceptron에 의한 한국어 연속 숫자음 인식)

박정선;권장우;권정상;이응혁;홍승홍
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.8
- /
- pp.105-113
- /
- 1996
In this paper, a new korean digita speech recognition technique was proposed using muktolayer perceptron (MLP). In spite of its weakness in dynamic signal recognition, MLP was adapted for this model, cecause korean syllable could give static features. It is so simle in its structure and fast in its computing that MLP was used to the suggested system. MLP's input vectors was transformed using karhunen-loeve transformation (KLT), which compress signal successfully without losin gits separateness, but its physical properties is changed. Because the suggested technique could extract static features while it is not affected from the changes of syllable lengths, it is effectively useful for korean numeric recognition system. Without decreasing classification rates, we can save the time and memory size for computation using KLT. The proposed feature extraction technique extracts same size of features form the tow same parts, front and end of a syllable. This technique makes frames, where features are extracted, using unique size of windows. It could be applied for continuous speech recognition that was not easy for the normal neural network recognition system.
PDF

Face Recognition using the Feature Space and the Image Vector (세그멘테이션에 의한 특징공간과 영상벡터를 이용한 얼굴인식)

김선종
- Journal of Institute of Control, Robotics and Systems
- /
- v.5 no.7
- /
- pp.821-826
- /
- 1999
This paper proposes a face recognition method using feature spaces and image vectors in the image plane. We obtain the 2-D feature space using the self-organizing map which has two inputs from the axis of the given image. The image vector consists of its weights and the average gray levels in the feature space. Also, we can reconstruct an normalized face by using the image vector having no connection with the size of the given face image. In the proposed method, each face is recognized with the best match of the feature spaces and the maximum match of the normally retrieval face images, respectively. For enhancing recognition rates, our method combines the two recognition methods by the feature spaces and the retrieval images. Simulations are conducted on the ORL(Olivetti Research laboratory) images of 40 persons, in which each person has 10 facial images, and the result shows 100% recognition and 14.5% rejection rates for the 20$\times$20 feature sizes and the 24$\times$28 retrieval image size.
PDF

CNN Based 2D and 2.5D Face Recognition For Home Security System (홈보안 시스템을 위한 CNN 기반 2D와 2.5D 얼굴 인식)

MaYing, MaYing;Kim, Kang-Chul
- The Journal of the Korea institute of electronic communication sciences
- /
- v.14 no.6
- /
- pp.1207-1214
- /
- 2019
Technologies of the 4th industrial revolution have been unknowingly seeping into our lives. Many IoT based home security systems are using the convolutional neural network(CNN) as good biometrics to recognize a face and protect home and family from intruders since CNN has demonstrated its excellent ability in image recognition. In this paper, three layouts of CNN for 2D and 2.5D image of small dataset with various input image size and filter size are explored. The simulation results show that the layout of CNN with 50*50 input size of 2.5D image, 2 convolution and max pooling layer, and 3*3 filter size for small dataset of 2.5D image is optimal for a home security system with recognition accuracy of 0.966. In addition, the longest CPU time consumption for one input image is 0.057S. The proposed layout of CNN for a face recognition is suitable to control the actuators in the home security system because a home security system requires good face recognition and short recognition time.
https://doi.org/10.13067/JKIECS.2019.14.6.1207 인용 PDF KSCI

Investication for KSK 9403: 2004 Recognition and Mother's Preference of Female Children's Apparel (여자 아동복 구입시 어머니의 선호도 및 KSK 9403: 2004 호칭 치수 인지도 조사)

Koo, Hee-Kyung
- Journal of the Korea Fashion and Costume Design Association
- /
- v.9 no.3
- /
- pp.87-97
- /
- 2007
This study is to investigate the KS size recognition and mother's preference of female children's apparel. The practical research is performed for 150 mothers lived in Seoul and are randomly selected to their age, female children's number, education and income level. For statistical analysis and evaluation of survey data, frequency and percentage use contingency table. Findings in this study as follow: 1. Mother's preference for purchasing the girl's garments shows the significant differences of their subject characteristics such as age, girl's number, education and income level. 2. Mother's recognition about KSK 9403: 2004 sizing system for girl's garments does not show the significant differences of their subject properties. Most mothers only know the part of the KS size specifications because KS sizing systems are complex. So KS sizing systems must be simplified and respecified to understand the KS for mothers easily when purchasing their girl's garments. In summary this paper investigates mother's preference and recognition about KS sizing system for the girl's garments.
PDF

Feature Extraction Method of 2D-DCT for Facial Expression Recognition (얼굴 표정인식을 위한 2D-DCT 특징추출 방법)

Kim, Dong-Ju;Lee, Sang-Heon;Sohn, Myoung-Kyu
- KIPS Transactions on Software and Data Engineering
- /
- v.3 no.3
- /
- pp.135-138
- /
- 2014
This paper devices a facial expression recognition method robust to overfitting using 2D-DCT and EHMM algorithm. In particular, this paper achieves enhanced recognition performance by setting up a large window size for 2D-DCT feature extraction and extracting the observation vectors of EHMM. The experimental results on the CK facial expression database and the JAFFE facial expression database showed that the facial expression recognition accuracy was improved according as window size is large. Also, the proposed method revealed the recognition accuracy of 87.79% and showed enhanced recognition performance ranging from 46.01% to 50.05% in comparison to previous approaches based on histogram feature, when CK database is employed for training and JAFFE database is used to test the recognition accuracy.
https://doi.org/10.3745/KTSDE.2014.3.3.135 인용 PDF KSCI

Face recognition rate comparison using Principal Component Analysis in Wavelet compression image (Wavelet 압축 영상에서 PCA를 이용한 얼굴 인식률 비교)

박장한;남궁재찬
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.41 no.5
- /
- pp.33-40
- /
- 2004
In this paper, we constructs face database by using wavelet comparison, and compare face recognition rate by using principle component analysis (Principal Component Analysis : PCA) algorithm. General face recognition method constructs database, and do face recognition by using normalized size. Proposed method changes image of normalized size (92${\times}$112) to 1 step, 2 step, 3 steps to wavelet compression and construct database. Input image did compression by wavelet and a face recognition experiment by PCA algorithm. As well as method that is proposed through an experiment reduces existing face image's information, the processing speed improved. Also, original image of proposed method showed recognition rate about 99.05%, 1 step 99.05%, 2 step 98.93%, 3 steps 98.54%, and showed that is possible to do face recognition constructing face database of large quantity.
PDF KSCI

The Effect of the Number of Clusters on Speech Recognition with Clustering by ART2/LBG

Lee, Chang-Young
- Phonetics and Speech Sciences
- /
- v.1 no.2
- /
- pp.3-8
- /
- 2009
In an effort to improve speech recognition, we investigated the effect of the number of clusters. In usual LBG clustering, the number of codebook clusters is doubled on each bifurcation and hence cannot be chosen arbitrarily in a natural way. To have the number of clusters at our control, we combined adaptive resonance theory (ART2) with LBG and perform the clustering in two stages. The codebook thus formed was used in subsequent processing of fuzzy vector quantization (FVQ) and HMM for speech recognition tests. Compared to conventional LBG, our method was shown to reduce the best recognition error rate by 0${\sim$}0.9% depending on the vocabulary size. The result also showed that between 400 and 800 would be the optimal number of clusters in the limit of small and large vocabulary speech recognitions of isolated words, respectively.
PDF

High Speed Character Recognition by Multiprocessor System (멀티 프로세서 시스템에 의한 고속 문자인식)

최동혁;류성원;최성남;김학수;이용균;박규태
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.2
- /
- pp.8-18
- /
- 1993
A multi-font, multi-size and high speed character recognition system is designed. The design principles are simpilcity of algorithm, adaptibility, learnability, hierachical data processing and attention by feed back. For the multi-size character recognition, the extracted character images are normalized. A hierachical classifier classifies the feature vectors. Feature is extracted by applying the directional receptive field after the directional dege filter processing. The hierachical classifier is consist of two pre-classifiers and one decision making classifier. The effect of two pre-classifiers is prediction to the final decision making classifier. With the pre-classifiers, the time to compute the distance of the final classifier is reduced. Recognition rate is 95% for the three documents printed in three kinds of fonts, total 1,700 characters. For high speed implemention, a multiprocessor system with the ring structure of four transputers is implemented, and the recognition speed of 30 characters per second is aquired.
PDF

Comparison of invariant pattern recognition algorithms (불변 패턴인식 알고리즘의 비교연구)

강대성
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.8
- /
- pp.30-41
- /
- 1996
This paper presents a comparative study of four pattern recognition algorithms which are invariant to translations, rotations, and scale changes of the input object; namely, object shape features (OSF), geometrica fourier mellin transform (GFMT), moment invariants (MI), and centered polar exponential transform (CPET). Pattern description is obviously one of the most important aspects of pattern recognition, which is useful to describe the object shape independently of translation, rotation, or size. We first discuss problems that arise in the conventional invariant pattern recognition algorithms, or size. We first discuss problems that arise in the coventional invariant pattern recognition algorithms, then we analyze their performance using the same criterion. Computer simulations with several distorted images show that the CPET algorithm yields better performance than the other ones.
PDF

A Comparative Study on OCR using Super-Resolution for Small Fonts

Cho, Wooyeong;Kwon, Juwon;Kwon, Soonchu;Yoo, Jisang
- International journal of advanced smart convergence
- /
- v.8 no.3
- /
- pp.95-101
- /
- 2019
Recently, there have been many issues related to text recognition using Tesseract. One of these issues is that the text recognition accuracy is significantly lower for smaller fonts. Tesseract extracts text by creating an outline with direction in the image. By searching the Tesseract database, template matching with characters with similar feature points is used to select the character with the lowest error. Because of the poor text extraction, the recognition accuracy is lowerd. In this paper, we compared text recognition accuracy after applying various super-resolution methods to smaller text images and experimented with how the recognition accuracy varies for various image size. In order to recognize small Korean text images, we have used super-resolution algorithms based on deep learning models such as SRCNN, ESRCNN, DSRCNN, and DCSCN. The dataset for training and testing consisted of Korean-based scanned images. The images was resized from 0.5 times to 0.8 times with 12pt font size. The experiment was performed on x0.5 resized images, and the experimental result showed that DCSCN super-resolution is the most efficient method to reduce precision error rate by 7.8%, and reduce the recall error rate by 8.4%. The experimental results have demonstrated that the accuracy of text recognition for smaller Korean fonts can be improved by adding super-resolution methods to the OCR preprocessing module.
https://doi.org/10.7236/IJASC.2019.8.3.95 인용 PDF KSCI

Search Result 963, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)