통합 검색 | Korea Science

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- 음성과학
- /
- 제11권1호
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

3D Nano Object Recognition based on Phase Measurement Technique

Kim, Dae-Suk;Baek, Byung-Joon;Kim, Young-Dong;Javidi, Bahram
- Journal of the Optical Society of Korea
- /
- 제11권3호
- /
- pp.108-112
- /
- 2007
Spectroscopic ellipsometry (SE) has become an important tool in scatterometry based nano-structure 3D profiling. In this paper, we propose a novel 3D nano object recognition method by use of phase sensitive scatterometry. We claims that only phase sensitive scatterometry can provide a reasonable 3D nano-object recognition capability since phase data gives much higher sensitive 3D information than amplitude data. To show the validity of this approach, first we generate various $0^{th}$ order SE spectrum data ($\psi$ and ${\Delta}$) which can be calculated through rigorous coupled-wave analysis (RCWA) algorithm and then we calculate correlation values between a reference spectrum and an object spectrum which is varied for several different object 3D shape.
https://doi.org/10.3807/JOSK.2007.11.3.108 인용 PDF KSCI

ASM기반 (2D)² 하이브리드 전처리 알고리즘을 이용한 얼굴인식 시스템 설계 (Design of ASM-based Face Recognition System Using (2D)² Hybird Preprocessing Algorithm)

김현기;진용탁;오성권
- 한국지능시스템학회논문지
- /
- 제24권2호
- /
- pp.173-178
- /
- 2014
본 연구에서는 ASM기반 $(2D)^2$ 하이브리드 전처리 알고리즘을 이용한 얼굴인식 분류기와 그것의 설계방법론을 소개한다. 얼굴인식을 위한 이미지는 외부 환경에 쉽게 영향을 받기 때문에, 전처리 단계로 이러한 문제를 해결하기 위해서 ASM을 사용하였다. 특히 사람 얼굴의 특징 추출을 목적으로 널리 이용되고 있다. ASM을 이용해 얼굴영역을 추출 한 뒤 PCA와 LDA를 이용한 $(2D)^2$ 하이브리드 전처리 알고리즘을 이용하여 차원을 축소한다. 전처리 알고리즘을 통한 얼굴데이터는 제안된 다항식 기반 방사형 기저함수 신경회로망의 입력으로 사용된다. 기존의 신경회로망과는 달리 제안된 지능형 패턴 분류기는 강인한 네트워크 특성을 가지며, 예측능력이 우수할 뿐만 아니라 다차원 입출력에 대한 문제도 해결했다. 분류기의 중요한 필수 설계 파라미터(행의 고유벡터의 수, 열의 고유벡터의 수, 클러스터의 수, 퍼지화 계수)는 ABC알고리즘에 의해 최적화 되어진다. 얼굴인식에 많이 사용되는 Yale과 AT&T를 사용하여 인식률을 평가하였다.
https://doi.org/10.5391/JKIIS.2014.24.2.173 인용 PDF KSCI

QP-DTW: Upgrading Dynamic Time Warping to Handle Quasi Periodic Time Series Alignment

Boulnemour, Imen;Boucheham, Bachir
- Journal of Information Processing Systems
- /
- 제14권4호
- /
- pp.851-876
- /
- 2018
Dynamic time warping (DTW) is the main algorithms for time series alignment. However, it is unsuitable for quasi-periodic time series. In the current situation, except the recently published the shape exchange algorithm (SEA) method and its derivatives, no other technique is able to handle alignment of this type of very complex time series. In this work, we propose a novel algorithm that combines the advantages of the SEA and the DTW methods. Our main contribution consists in the elevation of the DTW power of alignment from the lowest level (Class A, non-periodic time series) to the highest level (Class C, multiple-periods time series containing different number of periods each), according to the recent classification of time series alignment methods proposed by Boucheham (Int J Mach Learn Cybern, vol. 4, no. 5, pp. 537-550, 2013). The new method (quasi-periodic dynamic time warping [QP-DTW]) was compared to both SEA and DTW methods on electrocardiogram (ECG) time series, selected from the Massachusetts Institute of Technology - Beth Israel Hospital (MIT-BIH) public database and from the PTB Diagnostic ECG Database. Results show that the proposed algorithm is more effective than DTW and SEA in terms of alignment accuracy on both qualitative and quantitative levels. Therefore, QP-DTW would potentially be more suitable for many applications related to time series (e.g., data mining, pattern recognition, search/retrieval, motif discovery, classification, etc.).
https://doi.org/10.3745/JIPS.02.0090 인용 PDF KSCI

라즈베리파이와 OpenCV를 활용한 선형 검출 알고리즘 구현 (Implementation of Linear Detection Algorithm using Raspberry Pi and OpenCV)

이성진;최준형;최병윤
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2021년도 추계학술대회
- /
- pp.637-639
- /
- 2021
자율주행 연구가 활발히 진행되면서 ADAS(Advanced Driver Assistance System)에서 차량의 위치를 파악하고 경로를 유지하는데 차선 검출은 필수적인 기술이다. 차선 검출은 허프 변환과 RANSAC(Random Sample Consensus)과 같은 영상처리 알고리즘을 이용하여 검출한다. 본 논문은 라즈베리파이3 B+에 OpenCV를 이용하여 선형 도형 검출 알고리즘을 구현하고 있다. OpenCV 가우시안 블러 구조와 캐니 에지 검출을 통해 문턱값을 설정하였고, 선형 검출 알고리즘을 통한 차선 인식에 성공하였다.
PDF

교통표지판 인식을 위한 비젼시스템 (An Vision System for Traffic Sign Recognition)

남기환;배철수;박호식;박동희;한준희;나상동
- 한국정보통신학회:학술대회논문집
- /
- 한국해양정보통신학회 2003년도 추계종합학술대회
- /
- pp.645-648
- /
- 2003
본 논문에서는 영상처리를 이용하여 온라인으로 교통표지판을 인식하는 비젼 시스템을 제안한다. 제안된 시스템은 넓은 두 개의 카메라, 즉 광각렌즈(wide-angle lends)와 망원렌즈(telephoto lends)를 장착하였고, 이미지처리 보드가 있는 PC로 구성되었다. 이 시스템은 색상, 밝기, 형태 둥과 같은 정보를 이용하여 광각이미지의 교통표지판을 추출한 다음, 보다 큰 이미지에서 정확한 표지판 후보영역을 추출하기 위해 망원렌즈에서 포착된 이미지를 이용하여 처리하였다. 또한 간단한 실행과정으로 빠른 인식률을 얻을 수 있었으며, 도로상에서의 실험으로 시스템의 효용성을 입증하였다.
PDF

교통표지판 인식을 위한 비젼시스템 (An Vision System for Traffic sign Recognition)

남기환;배철수
- 한국정보통신학회논문지
- /
- 제8권2호
- /
- pp.471-476
- /
- 2004
본 논문에서는 영상처리를 이용하여 온라인으로 교통표지판을 인식하는 비젼 시스템을 제안한다. 제안된 시스템은 넓은 두 개의 카메라, 즉 광각렌즈(wide-angle lends)와 망원렌즈(telephoto lends)를 장착하였고, 이미지처리 보드가 있는 PC로 구성되었다. 이 시스템은 색상, 자기, 형태 등과 같은 정보를 이용하여 광각이미지의 교통표지판을 추출한 다음, 보다 큰 이미지에서 정확한 표지판 후보영역을 추출하기 위해 망원렌즈에서 포착된 이미지를 이용하여 처리하였다. 실험결과로써 수동으로 촬영한 비디오 연속장면에서 포착한 영상을 시용하여 실험한 결과 속도표지판은 추출율 96.5%, 인식률 34.4% 그리고 안내표지판은 추출율 100%, 인식률 40%의 결과를 나타내었다. 또한 간단한 실행과정으로 빠른 인식률을 얻을 수 있었으며, 도로상에서의 실험으로 시스템의 효용성을 입증하였다.
PDF KSCI

Dual-Encoded Features from Both Spatial and Curvelet Domains for Image Smoke Recognition

Yuan, Feiniu;Tang, Tiantian;Xia, Xue;Shi, Jinting;Li, Shuying
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제13권4호
- /
- pp.2078-2093
- /
- 2019
Visual smoke recognition is a challenging task due to large variations in shape, texture and color of smoke. To improve performance, we propose a novel smoke recognition method by combining dual-encoded features that are extracted from both spatial and Curvelet domains. A Curvelet transform is used to filter an image to generate fifty sub-images of Curvelet coefficients. Then we extract Local Binary Pattern (LBP) maps from these coefficient maps and aggregate histograms of these LBP maps to produce a histogram map. Afterwards, we encode the histogram map again to generate Dual-encoded Local Binary Patterns (Dual-LBP). Histograms of Dual-LBPs from Curvelet domain and Completed Local Binary Patterns (CLBP) from spatial domain are concatenated to form the feature for smoke recognition. Finally, we adopt Gaussian Kernel Optimization (GKO) algorithm to search the optimal kernel parameters of Support Vector Machine (SVM) for further improvement of classification accuracy. Experimental results demonstrate that our method can extract effective and reasonable features of smoke images, and achieve good classification accuracy.
https://doi.org/10.3837/tiis.2019.04.019 인용 PDF KSCI HTML

주문형 케이크 제작 자동화를 위한 영상 기반 식품 모양 인식 및 측위 (Vision-based Food Shape Recognition and Its Positioning for Automated Production of Custom Cakes)

오장섭;이재성
- 한국정보통신학회논문지
- /
- 제24권10호
- /
- pp.1280-1287
- /
- 2020
본 논문에서는 주문형 케이크의 자동화 제작을 위해 필요한 일련의 영상 기반 식품 모양 인식 및 정밀 측위 방법을 제안한다. 초소형 카메라 모듈을 푸드 아트 프린터 내부의 비스듬한 위치에 장착 후 원근 변환을 적용하여 탑 뷰(top veiw) 이미지로 전환 후 에지 검출, 직선 및 원형 허프 변환 등을 수행하도록 하여 식품의 모양을 인식하고 무게 중심 좌표를 검출하도록 하였다. 본 알고리즘을 케이크 및 떡 모형에 적용하여 테스트를 수행하였으며 그 결과 180~250 lux 범위의 조명 환경에서 98.75% 의 높은 객체 인식률을 얻을 수 있었으며 50~120 lux 범위에서 0.87% 이내의 중심점 측위 오차율을 얻을 수 있었다. 이는 수요처의 요구 사항을 충분히 만족하는 수치이며 실시간 처리 성능도 프레임당 0.5초 이내로 나타나 상용화 가능성을 충분히 갖춘 것으로 판단된다.
https://doi.org/10.6109/jkiice.2020.24.10.1280 인용 PDF KSCI

피부색과 무게중심 프로필을 이용한 손동작 인식 알고리즘 (Hand Motion Recognition Algorithm Using Skin Color and Center of Gravity Profile)

박영민
- 문화기술의 융합
- /
- 제7권2호
- /
- pp.411-417
- /
- 2021
인간과 컴퓨터의 상호작용을 연구하는 분야를 HCI(Human-computer interaction)라고 한다. 이 분야는 인간과 컴퓨터 간에 서로 소통하면서 정보를 인식하는 방법에 대해 연구하는 학문 분야이다. 본 연구는 사람과의 상호작용을 위한 손동작 인식에 대한 연구로써 기존 인식방법의 문제점을 살펴보고 인식률을 개선하기 위한 알고리즘을 제시한다. 사람의 손 모양이 포함된 영상을 대상으로 피부색 정보를 바탕으로 손 영역을 추출하고, 주성분 분석을 이용하여 무게중심 프로필을 계산한다. 이렇게 얻은 정보를 미리 정의된 형상들과 비교하여 손동작을 인식률을 높이는 방법을 제안하였다. 기존의 무게중심 프로필은 회전으로 인한 손의 변형에 대해 잘못된 손동작 인식을 결과를 보여주었으나, 본 연구에서는 무게중심 프로필을 이용하고 모든 윤곽선의 점들과 무게중심 사이의 거리가 가장 긴 점을 시작점으로 하여 무게중심 프로필을 다시 개선함으로써 강건한 알고리즘을 제시하였다. 손동작 인식을 위하여 센서가 부착된 장갑이나 특별한 마커를 사용하지 않으며, 별도의 청색 스크린을 설치하지도 않는다. 이 결과에 대해 가장 가까운 거리의 특징벡터를 찾아 잘못된 인식을 해결하고, 적당한 경계치를 구하여 성공과 실패를 구분한다.
https://doi.org/10.17703/JCCT.2021.7.2.411 인용 PDF KSCI

검색결과 233건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)