Search | Korea Science

Recognition Performance Improvement for Noisy-speech by Parallel Model Compensation Adaptation Using Frequency-variant added with ML (최대우도를 부가한 주파수 변이 PMC 방법의 잡음 음성 인식 성능개선)

Choi, Sook-Nam;Chung, Hyun-Yeol
- Journal of Korea Multimedia Society
- /
- v.16 no.8
- /
- pp.905-913
- /
- 2013
The Parallel Model Compensation Using Frequency-variant: FV-PMC for noise-robust speech recognition is a method to classify the noises, which are expected to be intermixed with input speech when recognized, into several groups of noises by setting average frequency variant as a threshold value; and to recognize the noises depending on the classified groups. This demonstrates the excellent performance considering noisy speech categorized as good using the standard threshold value. However, it also holds a problem to decrease the average speech recognition rate with regard to unclassified noisy speech, for it conducts the process of speech recognition, combined with noiseless model as in the existing PMC. To solve this problem, this paper suggests a enhanced method of recognition to prevent the unclassified through improving the extent of rating scales with use of maximum likelihood so that the noise groups, including input noisy speech, can be classified into more specific groups, which leads to improvement of the recognition rate. The findings from recognition experiments using Aurora 2.0 database showed the improved results compared with those from the method of the previous FV-PMC.
https://doi.org/10.9717/kmms.2013.16.8.905 인용 PDF KSCI

Low Power Turbo Decoder Design Techniques Using Two Stopping Criteria (이중 정지 기준을 사용한 저 전력 터보 디코더 설계 기술)

임호영;강원경;신현철;김경호
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.41 no.9
- /
- pp.39-48
- /
- 2004
Turbo codes, whose performance in bit error rate is close to the Shannon limit, have been adopted as a part of standard for the third-generation high-speed wireless data services. Iterative Turbo decoding results in decoding delay and high power consumption. As wireless communication systems can only use limited power supply, low power design techniques are essential for mobile device implementation. This paper proposes new effective criteria for stopping the iteration process in turbo decoding to reduce power consumption. By setting two stopping criteria, decodable threshold and undecodable threshold, we can effectively reduce the number of decoding iterations with only negligible error-correcting performance degradation. Simulation results show that the number of unsuccessful error-correction can be reduced by 89% and the number of decoding iterations can be reduced by 29% on the average among 12500 simulations when compared with those of an existing typical method.
PDF KSCI

Algorithm of Converged Corner Detection-based Segmentation in the Data Matrix Barcode (코너 검출 기반의 융합형 Data Matrix 바코드 분할 알고리즘)

Han, Hee-June;Lee, Jong-Yun
- Journal of the Korea Convergence Society
- /
- v.6 no.1
- /
- pp.7-16
- /
- 2015
A segmentation process extracts an interesting area of barcode in an image and gives a crucial impart on the performance of barcode verifier. Previous segmentation methods occurs some issues as follows. First, it is very hard to determine a threshold of length in Hough Line transform because it is sensitive. Second, Morphology transform delays the process when you conduct dilation and erosion operations during the image extraction. Therefore, we proposes a novel Converged Harris Corner detection-based segmentation method to detect an interesting area of barcode in Data Matrix. In order to evaluate the performance of proposed method, we conduct experiments by a dataset of barcode in accordance with size and location in an image. In result, our method solves the problems of delay and surrounding environments, threshold setting, and extracts the barcode area 100% from test images.
https://doi.org/10.15207/JKCS.2015.6.1.007 인용 PDF KSCI

Development of GIS System for Large Unit Life Information Guide in the Flash Environment (플래시 환경에서의 대단위 생활 정보 안내를 위한 지리정보시스템 개발)

Hwang, Kyung-Hee;Cho, Dong-Ju;Kim, Sung-Ho
- Proceedings of the Korea Contents Association Conference
- /
- 2007.11a
- /
- pp.89-92
- /
- 2007
This paper describes the GIS system that enables users to look up the final destination on a map automatically by setting up a random radius of threshold under the present user position. Various information systems based on the existing GIS have disadvantage that a search is difficult when the destination location is not clear. And it is inefficient that the processing time is delay due to a complex configuration and large amount of informations based on the target of special business supports. Therefore, this paper improves these problems and proposes the customized geographic information system which is for the general user. The customized geographic information system confirms the detail items from search results, which is destinations fitting for a condition in the suitable radius of threshold from the present user position. And the user can look up the suitable final destination on a map easily using the customized information system. The result of this paper, which is based on the sample of the large unit locations, will be expected to be able to guide more detail, extensive and various informations.
PDF

A Simple Connection Pruning Algorithm and its Application to Simulated Random Signal Classification (연결자 제거를 위한 간단한 알고리즘과 모의 랜덤 신호 분류에의 응용)

Won, Yong-Gwan;Min, Byeong-Ui
- The Transactions of the Korea Information Processing Society
- /
- v.3 no.2
- /
- pp.381-389
- /
- 1996
A simple modification of the standard back-propagation algorithm to eliminate redundant connections(weights and biases) is described. It was motivated by speculations from the distribution of the magnitudes of the weights and the biases, analysis of the classification boundary, and the nonlinearity of the sigmoid function. After initial training, this algorithm eliminates all connections of which magnitude is below a threshold by setting them to zero. The algorithm then conducts retraining in which all weights and biases are adjusted to allow important ones to recover. In studies with Boolean functions, the algorithm reconstructed the theoretical minimum architecture and eliminated the connections which are not necessary to solve the functions. For simulated random signal classification problems, the algorithm produced the result which is consistent with the idea that easier problems require simpler networks and yield lower misclassification rates. Furthermore, in comparison, our algorithm produced better generalization than the standard algorithm by reducing over fitting and pattern memorization problems.
PDF

A Study on Improving English Pronunciation and Intonation utilizing Fluency Improvement system (음성인식 학습 시스템활용 영어 발음 및 억양 개선방안에 관한 연구)

Yi, Jae-Il;Kim, Young-Kwon;Kim, Gui-Jung
- Journal of the Korea Convergence Society
- /
- v.8 no.11
- /
- pp.1-6
- /
- 2017
This paper focuses on the development of a system that improves the convenience of foreign language learning and enhaces the learning ability of the target language through the use of IT devices. In addition to the basic grammar, the importance of pronunciation and intonation have somewhat crucial effect in everyday communication. Pronunciation and intonation of English are different according to the basic characteristics of a native language and these differences often cause problems in communication. The proposed system distinguishes acceptability in English communication process and requests the correction in realtime. The proposed system minimizes system intervention by collecting various voice signals of foreign language learners and setting that can be considered as acceptable threshold points. As a result, the learner can increase the learning efficiency with minimal interruption of the utterance caused by unnecessary system intervention.
https://doi.org/10.15207/JKCS.2017.8.11.001 인용 PDF KSCI

Binary Mask Estimation using Training-based SNR Estimation for Improving Speech Intelligibility (음성 명료도 향상을 위한 학습 기반의 신호 대 잡음 비 추정을 이용한 이산 마스크 추정 방법)

Kim, Gibak
- Journal of Broadcast Engineering
- /
- v.17 no.6
- /
- pp.1061-1068
- /
- 2012
This paper deals with a noise reduction algorithm which uses the binary masking approach in the time-frequency domain to improve speech intelligibility. In the binary masking approach, the noise-corrupted speech is decomposed into time-frequency units. Noise-dominant time-frequency units are removed by setting the corresponding binary masks as "0"s and target-dominant units are retained untouched by assigning mask "1"s. We propose a binary mask estimation by comparing the local signal-to-noise ratio (SNR) to a threshold. The local SNR is estimated by a training-based approach. An optimal threshold is proposed, which is obtained from observing the distribution of the training database. The proposed method is evaluated by normal-hearing subjects and the intelligibility scores are computed by counting the number of words correctly recognized.
https://doi.org/10.5909/JBE.2012.17.6.1061 인용 PDF KSCI

Development of Radius Search System based on Raster Map in the Flash Environment (플래시 환경에서 래스터 지도를 기반으로 한 반경 검색 시스템 개발)

Kim, Sung-Ho
- The Journal of the Korea Contents Association
- /
- v.8 no.4
- /
- pp.39-47
- /
- 2008
This paper describes the life GIS(Geographic Information System) system that enables users to look up the final destination on a map automatically by setting up a random radius of threshold under the present user position. Various information systems based on the existing GIS have disadvantage that a search is difficult when the destination location is not clear. And it is inefficient that the processing time is delay due to a complex configuration and large amount of informations based on the target of special business supports. Therefore, this paper improves these problems and proposes the customized life GIS which is for the general user in the Flash(Action Script) environment. The customized life GIS confirms the detail items from search results, which is destinations fitting for a condition in the suitable radius of threshold from the present user position. And the user can look up the suitable final destination on a map easily using the customized information system. The result of this paper, which is based on the sample of the large unit locations, will be expected to be able to guide more detail, extensive and various informations.
https://doi.org/10.5392/JKCA.2008.8.4.039 인용 PDF

An Occupant Sensing System Using Single Video Camera and Ultrasonic Sensor for Advanced Airbag (단일 비디오 카메라와 초음파센서를 이용한 스마트 에어백용 승객 감지 시스템)

Bae, Tae-Wuk;Lee, Jong-Won;Ha, Su-Young;Kim, Young-Choon;Ahn, Sang-Ho;Sohng, Kyu-Ik
- Journal of Korea Multimedia Society
- /
- v.13 no.1
- /
- pp.66-75
- /
- 2010
We proposed an occupant sensing system using single video camera and ultrasonic sensor for the advanced airbag. To detect the occupant form and the face position in real-time, we used the skin color and motion information. We made the candidate face block image using the threshold value of the color difference signal corresponding to skin color and difference value of current image and previous image of luminance signal to gel motion information. And then it detects the face by the morphology and the labeling. In case of night without color and luminance information, it detects the face by using the threshold value of the luminance signal get by infra-red LED instead of the color difference signal. To evaluate the performance of the proposed occupant detection system, it performed various experiments through the setting of the IEEE camera, ultrasonic sensor, and infra-red LED in vehicle jig.
PDF KSCI

Apply Locally Weight Parameter Elimination for CNN Model Compression (지역적 가중치 파라미터 제거를 적용한 CNN 모델 압축)

Lim, Su-chang;Kim, Do-yeon
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.22 no.9
- /
- pp.1165-1171
- /
- 2018
CNN requires a large amount of computation and memory in the process of extracting the feature of the object. Also, It is trained from the network that the user has configured, and because the structure of the network is fixed, it can not be modified during training and it is also difficult to use it in a mobile device with low computing power. To solve these problems, we apply a pruning method to the pre-trained weight file to reduce computation and memory requirements. This method consists of three steps. First, all the weights of the pre-trained network file are retrieved for each layer. Second, take an absolute value for the weight of each layer and obtain the average. After setting the average to a threshold, remove the weight below the threshold. Finally, the network file applied the pruning method is re-trained. We experimented with LeNet-5 and AlexNet, achieved 31x on LeNet-5 and 12x on AlexNet.
https://doi.org/10.6109/jkiice.2018.22.9.1165 인용 PDF KSCI

Search Result 150, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)