Search | Korea Science

Utilization of Syllabic Nuclei Location in Korean Speech Segmentation into Phonemic Units (음절핵의 위치정보를 이용한 우리말의 음소경계 추출)

신옥근
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.5
- /
- pp.13-19
- /
- 2000
The blind segmentation method, which segments input speech data into recognition unit without any prior knowledge, plays an important role in continuous speech recognition system and corpus generation. As no prior knowledge is required, this method is rather simple to implement, but in general, it suffers from bad performance when compared to the knowledge-based segmentation method. In this paper, we introduce a method to improve the performance of a blind segmentation of Korean continuous speech by postprocessing the segment boundaries obtained from the blind segmentation. In the preprocessing stage, the candidate boundaries are extracted by a clustering technique based on the GLR(generalized likelihood ratio) distance measure. In the postprocessing stage, the final phoneme boundaries are selected from the candidates by utilizing a simple a priori knowledge on the syllabic structure of Korean, i.e., the maximum number of phonemes between any consecutive nuclei is limited. The experimental result was rather promising : the proposed method yields 25% reduction of insertion error rate compared that of the blind segmentation alone.
PDF

Effective Syllable Modeling for Korean Speech Recognition Using Continuous HMM (연속 은닉 마코프 모델을 이용한 한국어 음성 인식을 위한 효율적 음절 모델링)

김봉완;이용주
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.1
- /
- pp.23-27
- /
- 2003
Recently attempts to we the syllable as the recognition unit to enhance performance in continuous speech recognition hate been reported. However, syllables are worse in their trainability than phones and the former have a disadvantage in that contort-dependent modeling is difficult across the syllable boundary since the number of models is much larger for syllables than for phones. In this paper, we propose a method to enhance the trainability for the syllables in Korean and phoneme-context dependent syllable modeling across the syllable boundary. An experiment in which the proposed method is applied to word recognition shows average 46.23% error reduction in comparison with the common syllable modeling. The right phone dependent syllable model showed 16.7% error reduction compared with a triphone model.
PDF KSCI

A study of using quality for Radial Basis Function based score-level fusion in multimodal biometrics (RBF 기반 유사도 단계 융합 다중 생체 인식에서의 품질 활용 방안 연구)

Choi, Hyun-Soek;Shin, Mi-Young
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.45 no.5
- /
- pp.192-200
- /
- 2008
Multimodal biometrics is a method for personal authentication and verification using more than two types of biometrics data. RBF based score-level fusion uses pattern recognition algorithm for multimodal biometrics, seeking the optimal decision boundary to classify score feature vectors each of which consists of matching scores obtained from several unimodal biometrics system for each sample. In this case, all matching scores are assumed to have the same reliability. However, in recent research it is reported that the quality of input sample affects the result of biometrics. Currently the matching scores having low reliability caused by low quality of samples are not currently considered for pattern recognition modelling in multimodal biometrics. To solve this problem, in this paper, we proposed the RBF based score-level fusion approach which employs quality information of input biometrics data to adjust decision boundary. As a result the proposed method with Qualify information showed better recognition performance than both the unimodal biometrics and the usual RBF based score-level fusion without using quality information.
PDF KSCI

Filtering investigation for abstracting the road and seashore boundary using satellite images (위성영상으로 도로 및 해안 경계추출을 위한 필터링 기법 검토)

Choi Hyun;Kang In-Joon;Lee Byung-Gul
- Proceedings of the Korean Association of Geographic Inforamtion Studies Conference
- /
- 2006.05a
- /
- pp.73-77
- /
- 2006
제 3차 국가 GIS 기본 계획의 목적은 국가지리정보체계의 원활한 구축 및 지리정보 활용촉진으로 기존의 구가 GIS 구축의 양적 확산에서 질적 심화를 도모하고 있다. 고해상도 위성영상이 다양한 분야에서 활발하게 이용하게 됨에 따라 지형자료의 더 정확한 경계검출에 대한 필요성이 대두되고 있다. 위성영상을 이용한 도로 경계 검출은 교통정보시스템을 포함한 도로계획, 도시계획 등의 GIS 응용을 위한 필수 연구로 인식하고 있다. 본 연구는 IKONOS 영상에서 도로 경계 검출을 위한 고주파 와 저주파 필터링 비교분석에 관한 연구이다. 분석결과 저주파 필터링과 고주파 필터링은 입력영상의 경계부분에서 영상을 선택적으로 강조할 수 있었다. 저주파 필터링과 같은 영상강화 기법에서는 추출 가능한 경계부의 위치를 변화시키거나 영상의 화소값이 전체영상을 대상으로 변화시켜 비교적 도로 폭이 넓은 경우 효과적이었다. 고주파 필터링은 세부적인 영상정보를 선택적으로 강조할 수 있었다.
PDF

Edge Detection based on Non Local Means (비지역적 평균 기법을 이용한 경계 검출)

Kim, Han-Su;Choi, Myung-Ruyl
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.298-301
- /
- 2011
Edge detection is an base research task in the field of image processing. Edge detection can be regarded as a technique for locating pixels of abrupt gray-level change. So with Gradient method, it can be computed easily. But it can't satisfy human naked eye. so in this paper, new algorithm based on the NLM(Non Local Means) is proposed for good performance for human naked eye.
https://doi.org/10.3745/PKIPS.y2011m11a.298 인용 PDF

Korean Sentence Boundary Detection Using Memory-based Machine Learning (메모리 기반의 기계 학습을 이용한 한국어 문장 경계 인식)

Han Kun-Heui;Lim Heui-Seok
- The Journal of the Korea Contents Association
- /
- v.4 no.4
- /
- pp.133-139
- /
- 2004
This paper proposes a Korean sentence boundary detection system which employs k-nearest neighbor algorithm. We proposed three scoring functions to classify sentence boundary and performed comparative analysis. We uses domain independent linguistic features in order to make a general and robust system. The proposed system was trained and evaluated on the two kinds of corpus; ETRI corpus and KAIST corpus. As experimental results, the proposed system shows about $98.82\%$ precision and $99.09\%$ recall rate even though it was trained on relatively small corpus.
PDF

Face Recognition Using LDA and Weighted Vector (LDA와 가중치 벡터를 이용한 얼굴인식)

Jang, Kuyng-Shik
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- v.9 no.1
- /
- pp.1161-1164
- /
- 2005
본 논문에서는 얼굴 영상에서 눈동자와 입술을 효과적으로 인식하는 방법을 제안하였다. 색 정보를 기반으로 LDA를 이용하여 입술 영역을 찾았다. 눈동자와 흰자위로 구성되는 눈의 형태적인 특징과 눈동자와 눈썹 사이의 관계를 반영하는 평가함수를 정의하여 눈동자를 인식하였다. 입술에서의 밝기차이를 기반으로 가중치 벡터를 정의하여 위 입술과 아래 입술 사이의 경계선을 찾고 입술과 인접한 피부와의 밝기 차이를 이용하여 입술의 양 끝점 및 위와 아래의 끝점을 찾았다. 여러 영상에 대한 실험 결과 좋은 결과를 얻었다.
PDF

Learning Data Configuration by Edge Detection (경계선 검출에 의한 학습 데이터 구성)

Jae-Hyun Cho
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2024.01a
- /
- pp.413-414
- /
- 2024
영상 인식을 위한 학습 데이터 구성 단계에서 에지는 물체의 크기, 방향 등의 정보를 포함하고 있어 영상의 특징으로 사용한다. 본 논문에서는 얼굴 인식을 위하여 소벨 마스크를 사용하여 원영상과 압축영상 그리고 에지영상간의 학습에 따른 인식 정도를 파악하고자 한다. 실험결과, 원영상 그대로 인식하는 것보다 에지 영상에 의한 학습 속도에 차이가 있음을 알 수 있었다.
PDF

Building Identification for 3D Modeling of Urban Area (도심지 3D 모델링을 위한 동일건물 인식)

Sohn, Hong-Gyoo;Park, Jung-Hwan;Kim, Ho-Sung
- 한국공간정보시스템학회:학술대회논문집
- /
- 2005.05a
- /
- pp.453-457
- /
- 2005
3차원 지형공간정보체계에 대한 관심의 증가와 함께 도심지의 3차원 모델링에 관한 다양한 연구가 활발히 진행되고 있다. 단색영상을 용하여 영역기반정합이나 형상기반정합을 실시하던 기존의 3차원 모델링 기법은 오정합이 많이 발생할 수 있으며, 모델링에 소요되는 시간이 많이 걸리는 단점이 있다. 따라서 본 논문에서는 새로운 3D 모델링에 대한 접근법의 하나의 단계로서 컬러영상으로부터 경계정보와 색상정보를 활용하여 동일건물을 인식하는 방법에 대하여 연구를 수행하였다. 경계정보에 대해서는 보완된 Hausdorff 거리 개념을 사용하였으며, 색상정보에 대해서는 수정된 컬러 인덱싱 기법을 사용하였다 IKONOS영상을 사용하여 실험을 실시한 결과 두 가지 정보를 각각 단독으로 사용하는 경우 보다는 두 가지 정보를 조합하여 사용하는 경우 인식이 보다 효과적으로 이루어지는 것을 확인할 수 있었다.
PDF

Efficient Real-time Lane Detection Algorithm Using V-ROI (V-ROI를 이용한 고효율 실시간 차선 인식 알고리즘)

Dajun, Ding;Lee, Chanho
- Journal of IKEEE
- /
- v.16 no.4
- /
- pp.349-355
- /
- 2012
Information technology improves convenience, safety, and performance of automobiles. Recently, a lot of algorithms are studied to provide safety and environment information for driving, and lane detection algorithm is one of them. In this paper, we propose a lane detection algorithm that reduces the amount of calculation by reducing region of interest (ROI) after preprocessing. The proposed algorithm reduces the area of ROI a lot by determining the candidate regions near lane boundaries as V-ROI so that the amount of calculation is reduced. In addition, the amount of calculation can be maintained almost the same regardless of the resolutions of the input images by compressing the images since the lane detection algorithm does not require high resolution. The proposed algorithm is implemented using C++ and OpenCV library and is verified to work at 30 fps for realtime operation.
https://doi.org/10.7471/ikeee.2012.16.4.349 인용 PDF KSCI

Search Result 661, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)