• Title/Summary/Keyword: Recognition Improvement

Search Result 1,496, Processing Time 0.036 seconds

A Study on the Improvement of Utilization through Recognition of Virtual Training Content Operating Institutions (가상훈련 콘텐츠 운영기관 인식을 통한 활용도 제고방안 연구)

  • Miseok Yang;Chang Heon Oh
    • Journal of Practical Engineering Education
    • /
    • v.14 no.3
    • /
    • pp.479-489
    • /
    • 2022
  • In order to understand how to increase the use of virtual training content at K University's online lifelong education institute, this study examined the use experience, content recognition, field practice replacement, and requirements, focusing on the examples of operating institutions. To this end, 12 institutions that operated virtual training contents distributed by the K University Online Lifelong Education Center in 2020 were selected for in-depth interviews and qualitative analysis was conducted on the interviews of 11 institutions. As a result of the analysis, first, the experience of using the contents of the virtual training operating institution was aimed at changing the educational environment, supplementing theoretical learning, and improving the sense of practice. Second, according to a survey on the recognition of virtual training content, if the importance and utilization of the content are high, it can be replaced by on-site practice in non-face-to-face classes, such as experiences of facilities and equipment, attracting interest and attention. Third, in many cases, the perception of replacement for field practice is not unreasonable to use as a pre-training material for field practice, but it is difficult to replace field practice. Fourth, content quality improvements can be summarized as content quality improvement, content access and manipulation improvement, dedicated device development, training for instructors, and curriculum systematization. Fifth, institutional requirements include improving the quality of virtual training content itself, equipment support, curriculum systemization and characterization, systematic curriculum and detailed content sharing, detailed guidance on using virtual training content, introducing how to use content, and recruiting instructors. This study is meaningful in that it sought ways to improve the utilization of virtual training content based on the perception of virtual training content operating institutions.

Roadway recognition performance improvement for an autonomous vehicle using magnetic sensor (자기 센서 방식 자율 주행 차량의 경로 인식 성능 개선)

  • Kim, Myoung-Jun;Kim, Eui-Sun;Ryoo, Young-Jae;Lim, Young-Cheol
    • Journal of Sensor Science and Technology
    • /
    • v.12 no.5
    • /
    • pp.211-217
    • /
    • 2003
  • This paper is proposed that roadway recognition performance improvement for autonomous vehicle using magnetic markers that are embedded along the road center and the sensors mounted on a vehicle, and which changing of magnetic field that is measured along with vehicle driving. For Retrenchment of equipment cost, interval of markers is more expensive than existing method. In order to this, This paper is proposed that interval of markers is founded using magnetic field analysis, and which arrangement method of six magnetic sensors and control method of neural network. This paper is carried out magnetic field analysis, the acquiring of the training patterns, the training of the neural network and composition of steering control, and is verified that roadway recognition performance can improve using computer simulation with proposed methods.

Face Recognition System for Multimedia Application (멀티미디어 응용을 위한 얼굴 인식시스템)

  • Park, Sang-Gyou;Seong, Hyeon-Kyeong;Han, Young-Hwan
    • Journal of IKEEE
    • /
    • v.6 no.2 s.11
    • /
    • pp.152-160
    • /
    • 2002
  • This paper is the realization of the face recognition system for multimedia application. This system is focused on the design concerning the improvement of recognition rate and the reduction of processing time for face recognition. The non-modificated application of typical RGB color system enables the reduction of time required for color system transform. The neural network and the application of algorithm using face characteristic improves the recognition rate. After mosaicking an image, a face-color block has been selected through the color analysis of mosaic block. The characteristic of the face removes the mis-checked face-color candidate block. Finally, from the face color block, four special values are obtained. These values are processed to the neural network using the back propagation algorithm. The output values are the touchstone to decide the genuineness of face field. The realized system showed 90% of face recognition rate with less than 0.1 second of processing time. This result can be understood as sufficient processing time and recognition rate to find out the face block for multimedia application in dynamic image.

  • PDF

Gabor Descriptors Extraction in the SURF Feature Point for Improvement Accuracy in Face Recognition (얼굴 인식의 정확도 향상을 위한 SURF 특징점에서의 Gabor 기술어 추출)

  • Lee, Jae-Yong;Kim, Ji-Eun;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.17 no.5
    • /
    • pp.808-816
    • /
    • 2012
  • Face recognition has been actively studied and developed in various fields. In recent years, interest point extraction algorithms mainly used for object recognition were being applied to face recognition. The SURF(Speeded Up Robust Features) algorithm was used in this paper which was one of typical interest point extraction algorithms. Generally, the interest points extracted from human faces are less distinctive than the interest points extracted from objects due to the similar shapes of human faces. Thus, the accuracy of the face recognition using SURF tends to be low. In order to improve it, we propose a face recognition algorithm which performs interest point extraction by SURF and the Gabor wavelet transform to extract descriptors from the interest points. In the result, the proposed method shows around 23% better recognition accuracy than SURF-based conventional methods.

Performance Improvement of Continuous Digits Speech Recognition Using the Transformed Successive State Splitting and Demi-syllable Pair (반음절쌍과 변형된 연쇄 상태 분할을 이용한 연속 숫자 음 인식의 성능 향상)

  • Seo Eun-Kyoung;Choi Gab-Keun;Kim Soon-Hyob;Lee Soo-Jeong
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.1
    • /
    • pp.23-32
    • /
    • 2006
  • This paper describes the optimization of a language model and an acoustic model to improve speech recognition using Korean unit digits. Since the model is composed of a finite state network (FSN) with a disyllable, recognition errors of the language model were reduced by analyzing the grammatical features of Korean unit digits. Acoustic models utilize a demisyllable pair to decrease recognition errors caused by inaccurate division of a phone or monosyllable due to short pronunciation time and articulation. We have used the K-means clustering algorithm with the transformed successive state splitting in the feature level for the efficient modelling of feature of the recognition unit. As a result of experiments, 10.5% recognition rate is raised in the case of the proposed language model. The demi-syllable fair with an acoustic model increased 12.5% recognition rate and 1.5% recognition rate is improved in transformed successive state splitting.

  • PDF

A Study on the Realization of Wireless Home Network System Using High-performance Speech Recognition in Variable Position (가변위치 고음성인식 기술을 이용한 무선 홈 네트워크 시스템 구현에 관한 연구)

  • Yoon, Jun-Chul;Choi, Sang-Bang;Park, Chan-Sub;Kim, Se-Yong;Kim, Ki-Man;Kang, Suk-Youb
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.4
    • /
    • pp.991-998
    • /
    • 2010
  • In realization of wireless home network system using speech recognition in indoor voice recognition environment, background noise and reverberation are two main causes of digression in voice recognition system. In this study, the home network system resistant to reverberation and background noise using voice section detection method based on spectral entropy in indoor recognition environment is to be realized. Spectral subtraction can reduce the effect of reverberation and remove noise independent from voice signal by eliminating signal distorted by reverberation in spectrum. For effective spectral subtraction, the correct separation of voice section and silent section should be accompanied and for this, improvement of performance needs to be done, applying to voice section detection method based on entropy. In this study, experimental and indoor environment testing is carried out to figure out command recognition rate in indoor recognition environment. The test result shows that command recognition rate improved in static environment and reverberant room condition, using voice section detection method based on spectral entropy.

Performance Improvement of Continuous Digits Speech Recognition using the Transformed Successive State Splitting and Demi-syllable pair (반음절쌍과 변형된 연쇄 상태 분할을 이용한 연속 숫자음 인식의 성능 향상)

  • Kim Dong-Ok;Park No-Jin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.8
    • /
    • pp.1625-1631
    • /
    • 2005
  • This paper describes an optimization of a language model and an acoustic model that improve the ability of speech recognition with Korean nit digit. Recognition errors of the language model are decreasing by analysis of the grammatical feature of korean unit digits, and then is made up of fsn-node with a disyllable. Acoustic model make use of demi-syllable pair to decrease recognition errors by inaccuracy division of a phone, a syllable because of a monosyllable, a short pronunciation and an articulation. we have used the k-means clustering algorithm with the transformed successive state splining in feature level for the efficient modelling of the feature of recognition unit . As a result of experimentations, $10.5\%$ recognition rate is raised in the case of the proposed language model. The demi-syllable pair with an acoustic model increased $12.5\%$ recognition rate and $1.5\%$ recognition rate is improved in transformed successive state splitting.

Representative Batch Normalization for Scene Text Recognition

  • Sun, Yajie;Cao, Xiaoling;Sun, Yingying
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.7
    • /
    • pp.2390-2406
    • /
    • 2022
  • Scene text recognition has important application value and attracted the interest of plenty of researchers. At present, many methods have achieved good results, but most of the existing approaches attempt to improve the performance of scene text recognition from the image level. They have a good effect on reading regular scene texts. However, there are still many obstacles to recognizing text on low-quality images such as curved, occlusion, and blur. This exacerbates the difficulty of feature extraction because the image quality is uneven. In addition, the results of model testing are highly dependent on training data, so there is still room for improvement in scene text recognition methods. In this work, we present a natural scene text recognizer to improve the recognition performance from the feature level, which contains feature representation and feature enhancement. In terms of feature representation, we propose an efficient feature extractor combined with Representative Batch Normalization and ResNet. It reduces the dependence of the model on training data and improves the feature representation ability of different instances. In terms of feature enhancement, we use a feature enhancement network to expand the receptive field of feature maps, so that feature maps contain rich feature information. Enhanced feature representation capability helps to improve the recognition performance of the model. We conducted experiments on 7 benchmarks, which shows that this method is highly competitive in recognizing both regular and irregular texts. The method achieved top1 recognition accuracy on four benchmarks of IC03, IC13, IC15, and SVTP.

A study on recognition improvement of velopharyngeal insufficiency patient's speech using various types of deep neural network (심층신경망 구조에 따른 구개인두부전증 환자 음성 인식 향상 연구)

  • Kim, Min-seok;Jung, Jae-hee;Jung, Bo-kyung;Yoon, Ki-mu;Bae, Ara;Kim, Wooil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.6
    • /
    • pp.703-709
    • /
    • 2019
  • This paper proposes speech recognition systems employing Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) structures combined with Hidden Markov Moldel (HMM) to effectively recognize the speech of VeloPharyngeal Insufficiency (VPI) patients, and compares the recognition performance of the systems to the Gaussian Mixture Model (GMM-HMM) and fully-connected Deep Neural Network (DNNHMM) based speech recognition systems. In this paper, the initial model is trained using normal speakers' speech and simulated VPI speech is used for generating a prior model for speaker adaptation. For VPI speaker adaptation, selected layers are trained in the CNN-HMM based model, and dropout regulatory technique is applied in the LSTM-HMM based model, showing 3.68 % improvement in recognition accuracy. The experimental results demonstrate that the proposed LSTM-HMM-based speech recognition system is effective for VPI speech with small-sized speech data, compared to conventional GMM-HMM and fully-connected DNN-HMM system.

GMM based Speaker Identification using Pitch Information (피치 정보를 이용한 GMM 기반의 화자 식별)

  • Park Taesun;Hahn Minsoo
    • MALSORI
    • /
    • no.47
    • /
    • pp.121-129
    • /
    • 2003
  • This paper describes the use of pitch information for speaker identification. The recognition system is a GMM based one with 4 connected Korean digits speech database. The mean of the pitch period in voiced sections of speech are shown to be ,useful at discriminating between speakers. Utilizing this feature with Gaussian mixture model in the speaker identification system gave a marked improvement, maximum 6% improvement comparing to the baseline Gaussian mixture model.

  • PDF