Search | Korea Science

Harmonics-based Spectral Subtraction and Feature Vector Normalization for Robust Speech Recognition

Beh, Joung-Hoon;Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
- Speech Sciences
- /
- v.11 no.1
- /
- pp.7-20
- /
- 2004
In this paper, we propose a two-step noise compensation algorithm in feature extraction for achieving robust speech recognition. The proposed method frees us from requiring a priori information on noisy environments and is simple to implement. First, in frequency domain, the Harmonics-based Spectral Subtraction (HSS) is applied so that it reduces the additive background noise and makes the shape of harmonics in speech spectrum more pronounced. We then apply a judiciously weighted variance Feature Vector Normalization (FVN) to compensate for both the channel distortion and additive noise. The weighted variance FVN compensates for the variance mismatch in both the speech and the non-speech regions respectively. Representative performance evaluation using Aurora 2 database shows that the proposed method yields 27.18% relative improvement in accuracy under a multi-noise training task and 57.94% relative improvement under a clean training task.
PDF

3D Nano Object Recognition based on Phase Measurement Technique

Kim, Dae-Suk;Baek, Byung-Joon;Kim, Young-Dong;Javidi, Bahram
- Journal of the Optical Society of Korea
- /
- v.11 no.3
- /
- pp.108-112
- /
- 2007
Spectroscopic ellipsometry (SE) has become an important tool in scatterometry based nano-structure 3D profiling. In this paper, we propose a novel 3D nano object recognition method by use of phase sensitive scatterometry. We claims that only phase sensitive scatterometry can provide a reasonable 3D nano-object recognition capability since phase data gives much higher sensitive 3D information than amplitude data. To show the validity of this approach, first we generate various $0^{th}$ order SE spectrum data ($\psi$ and ${\Delta}$) which can be calculated through rigorous coupled-wave analysis (RCWA) algorithm and then we calculate correlation values between a reference spectrum and an object spectrum which is varied for several different object 3D shape.
https://doi.org/10.3807/JOSK.2007.11.3.108 인용 PDF KSCI

Design of ASM-based Face Recognition System Using (2D)² Hybird Preprocessing Algorithm (ASM기반 (2D)² 하이브리드 전처리 알고리즘을 이용한 얼굴인식 시스템 설계)

Kim, Hyun-Ki;Jin, Yong-Tak;Oh, Sung-Kwun
- Journal of the Korean Institute of Intelligent Systems
- /
- v.24 no.2
- /
- pp.173-178
- /
- 2014
In this study, we introduce ASM-based face recognition classifier and its design methodology with the aid of 2-dimensional 2-directional hybird preprocessing algorithm. Since the image of face recognition is easily affected by external environments, ASM(active shape model) as image preprocessing algorithm is used to resolve such problem. In particular, ASM is used widely for the purpose of feature extraction for human face. After extracting face image area by using ASM, the dimensionality of the extracted face image data is reduced by using $(2D)^2$hybrid preprocessing algorithm based on LDA and PCA. Face image data through preprocessing algorithm is used as input data for the design of the proposed polynomials based radial basis function neural network. Unlike as the case in existing neural networks, the proposed pattern classifier has the characteristics of a robust neural network and it is also superior from the view point of predictive ability as well as ability to resolve the problem of multi-dimensionality. The essential design parameters (the number of row eigenvectors, column eigenvectors, and clusters, and fuzzification coefficient) of the classifier are optimized by means of ABC(artificial bee colony) algorithm. The performance of the proposed classifier is quantified through yale and AT&T dataset widely used in the face recognition.
https://doi.org/10.5391/JKIIS.2014.24.2.173 인용 PDF KSCI

QP-DTW: Upgrading Dynamic Time Warping to Handle Quasi Periodic Time Series Alignment

Boulnemour, Imen;Boucheham, Bachir
- Journal of Information Processing Systems
- /
- v.14 no.4
- /
- pp.851-876
- /
- 2018
Dynamic time warping (DTW) is the main algorithms for time series alignment. However, it is unsuitable for quasi-periodic time series. In the current situation, except the recently published the shape exchange algorithm (SEA) method and its derivatives, no other technique is able to handle alignment of this type of very complex time series. In this work, we propose a novel algorithm that combines the advantages of the SEA and the DTW methods. Our main contribution consists in the elevation of the DTW power of alignment from the lowest level (Class A, non-periodic time series) to the highest level (Class C, multiple-periods time series containing different number of periods each), according to the recent classification of time series alignment methods proposed by Boucheham (Int J Mach Learn Cybern, vol. 4, no. 5, pp. 537-550, 2013). The new method (quasi-periodic dynamic time warping [QP-DTW]) was compared to both SEA and DTW methods on electrocardiogram (ECG) time series, selected from the Massachusetts Institute of Technology - Beth Israel Hospital (MIT-BIH) public database and from the PTB Diagnostic ECG Database. Results show that the proposed algorithm is more effective than DTW and SEA in terms of alignment accuracy on both qualitative and quantitative levels. Therefore, QP-DTW would potentially be more suitable for many applications related to time series (e.g., data mining, pattern recognition, search/retrieval, motif discovery, classification, etc.).
https://doi.org/10.3745/JIPS.02.0090 인용 PDF KSCI

Implementation of Linear Detection Algorithm using Raspberry Pi and OpenCV (라즈베리파이와 OpenCV를 활용한 선형 검출 알고리즘 구현)

Lee, Sung-jin;Choi, Jun-hyeong;Choi, Byeong-yoon
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.637-639
- /
- 2021
As autonomous driving research is actively progressing, lane detection is an essential technology in ADAS (Advanced Driver Assistance System) to locate a vehicle and maintain a route. Lane detection is detected using an image processing algorithm such as Hough transform and RANSAC (Random Sample Consensus). This paper implements a linear shape detection algorithm using OpenCV on Raspberry Pi 3 B+. Thresholds were set through OpenCV Gaussian blur structure and Canny edge detection, and lane recognition was successful through linear detection algorithm.
PDF

An Vision System for Traffic Sign Recognition (교통표지판 인식을 위한 비젼시스템)

남기환;배철수;박호식;박동희;한준희;나상동
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2003.10a
- /
- pp.645-648
- /
- 2003
This paper presents an active vision system for on-line traffic sign recognition. The system is composed of two cameras, one is equipped with a wide-angle lens and the other with a telephoto lends, and a PC with an image processing board. The system first detects candidates for traffic signs in the wide-angle image using color, intensity, and shape information. For each candidate, the telephoto-camera is directed to its predicted position to capture the candidate in a large size in the image. The recognition algorithm is designed by intensively using built in functions of an off-the-shelf mage processing board to realize both easy implementation and fast recognition. The results of on-road experiments show the feasibility of the system.
PDF

An Vision System for Traffic sign Recognition (교통표지판 인식을 위한 비젼시스템)

남기환;배철수
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.8 no.2
- /
- pp.471-476
- /
- 2004
This paper presents an active vision system for on-line traffic sign recognition. The system is composed of two cameras, one is equipped with a wide-angle lens and the other with a telephoto lends, and a PC with an image processing board. The system first detects candidates for traffic signs in the wide-angle image using color, intensity, and shape information. For each candidate, the telephoto-camera is directed to its predicted position to capture the candidate in a large size in the image. The recognition algorithm is designed by intensively using built in functions of an off-the-shelf image processing board to realize both easy implementation and fast recognition. The results of on-road experiments show the feasibility of the system.
PDF KSCI

Dual-Encoded Features from Both Spatial and Curvelet Domains for Image Smoke Recognition

Yuan, Feiniu;Tang, Tiantian;Xia, Xue;Shi, Jinting;Li, Shuying
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.4
- /
- pp.2078-2093
- /
- 2019
Visual smoke recognition is a challenging task due to large variations in shape, texture and color of smoke. To improve performance, we propose a novel smoke recognition method by combining dual-encoded features that are extracted from both spatial and Curvelet domains. A Curvelet transform is used to filter an image to generate fifty sub-images of Curvelet coefficients. Then we extract Local Binary Pattern (LBP) maps from these coefficient maps and aggregate histograms of these LBP maps to produce a histogram map. Afterwards, we encode the histogram map again to generate Dual-encoded Local Binary Patterns (Dual-LBP). Histograms of Dual-LBPs from Curvelet domain and Completed Local Binary Patterns (CLBP) from spatial domain are concatenated to form the feature for smoke recognition. Finally, we adopt Gaussian Kernel Optimization (GKO) algorithm to search the optimal kernel parameters of Support Vector Machine (SVM) for further improvement of classification accuracy. Experimental results demonstrate that our method can extract effective and reasonable features of smoke images, and achieve good classification accuracy.
https://doi.org/10.3837/tiis.2019.04.019 인용 PDF KSCI HTML

Vision-based Food Shape Recognition and Its Positioning for Automated Production of Custom Cakes (주문형 케이크 제작 자동화를 위한 영상 기반 식품 모양 인식 및 측위)

Oh, Jang-Sub;Lee, Jaesung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.24 no.10
- /
- pp.1280-1287
- /
- 2020
This paper proposes a vision-based food recognition method for automated production of custom cakes. A small camera module mounted on a food art printer recognizes objects' shape and estimates their center points through image processing. Through the perspective transformation, the top-view image is obtained from the original image taken at an oblique position. The line and circular hough transformations are applied to recognize square and circular shapes respectively. In addition, the center of gravity of each figure are accurately detected in units of pixels. The test results show that the shape recognition rate is more than 98.75% under 180 ~ 250 lux of light and the positioning error rate is less than 0.87% under 50 ~ 120 lux. These values sufficiently meet the needs of the corresponding market. In addition, the processing delay is also less than 0.5 seconds per frame, so the proposed algorithm is suitable for commercial purpose.
https://doi.org/10.6109/jkiice.2020.24.10.1280 인용 PDF KSCI

Hand Motion Recognition Algorithm Using Skin Color and Center of Gravity Profile (피부색과 무게중심 프로필을 이용한 손동작 인식 알고리즘)

Park, Youngmin
- The Journal of the Convergence on Culture Technology
- /
- v.7 no.2
- /
- pp.411-417
- /
- 2021
The field that studies human-computer interaction is called HCI (Human-computer interaction). This field is an academic field that studies how humans and computers communicate with each other and recognize information. This study is a study on hand gesture recognition for human interaction. This study examines the problems of existing recognition methods and proposes an algorithm to improve the recognition rate. The hand region is extracted based on skin color information for the image containing the shape of the human hand, and the center of gravity profile is calculated using principal component analysis. I proposed a method to increase the recognition rate of hand gestures by comparing the obtained information with predefined shapes. We proposed a method to increase the recognition rate of hand gestures by comparing the obtained information with predefined shapes. The existing center of gravity profile has shown the result of incorrect hand gesture recognition for the deformation of the hand due to rotation, but in this study, the center of gravity profile is used and the point where the distance between the points of all contours and the center of gravity is the longest is the starting point. Thus, a robust algorithm was proposed by re-improving the center of gravity profile. No gloves or special markers attached to the sensor are used for hand gesture recognition, and a separate blue screen is not installed. For this result, find the feature vector at the nearest distance to solve the misrecognition, and obtain an appropriate threshold to distinguish between success and failure.
https://doi.org/10.17703/JCCT.2021.7.2.411 인용 PDF KSCI

Search Result 233, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)