Search | Korea Science

Speech Feature Extraction Using Auditory Model (청각모델을 이용한 음성신호의 특징 추출 방법에 관한 연구)

Park, Kyu-Hong;Kim, Young-Ho;Jung, Sang-Kuk;Rho, Seung-Yong
- Proceedings of the KIEE Conference
- /
- 1998.07g
- /
- pp.2259-2261
- /
- 1998
Auditory Models that are capable of achieving human performance would provide a basis for realizing effective speech processing systems. Perceptual invariance to adverse signal conditions (noise, microphone and channel distortions, room reverberations) may provide a basis for robust speech recognition and speech coder with high efficiency. Auditory model that simulates the part of auditory periphery up through the auditory nerve level and new distance measure that is defined as angle between vectors are described.
PDF

Speech Recognition in Noisy Environments using Wiener Filtering (Wiener Filtering을 이용한 잡음환경에서의 음성인식)

Kim, Jin-Young;Eom, Ki-Wan;Choi, Hong-Sub
- Speech Sciences
- /
- v.1
- /
- pp.277-283
- /
- 1997
In this paper, we present a robust recognition algorithm based on the Wiener filtering method as a research tool to develop the Korean Speech recognition system. We especially used Wiener filtering method in cepstrum-domain, because the method in frequency-domain is computationally expensive and complex. Evaluation of the effectiveness of this method has been conducted in speaker-independent isolated Korean digit recognition tasks using discrete HMM speech recognition systems. In these tasks, we used 12th order weighted cepstral as a feature vector and added computer simulated white gaussian noise of different levels to clean speech signals for recognition experiments under noisy conditions. Experimental results show that the presented algorithm can provide an improvement in recognition of as much as from $5\%\;to\;\20\%$ in comparison to spectral subtraction method.
PDF

Structure-Preserving Mesh Simplification

Chen, Zhuo;Zheng, Xiaobin;Guan, Tao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.11
- /
- pp.4463-4482
- /
- 2020
Mesh model generated from 3D reconstruction usually comes with lots of noise, which challenges the performance and robustness of mesh simplification approaches. To overcome this problem, we present a novel method for mesh simplification which could preserve structure and improve the accuracy. Our algorithm considers both the planar structures and linear features. In the preprocessing step, it automatically detects a set of planar structures through an iterative diffusion approach based on Region Seed Growing algorithm; then robust linear features of the mesh model are extracted by exploiting image information and planar structures jointly; finally we simplify the mesh model with plane constraint QEM and linear feature preserving strategies. The proposed method can overcome the known problem that current simplification methods usually degrade the structural characteristics, especially when the decimation is extreme. Our experimental results demonstrate that the proposed method, compared to other simplification algorithms, can effectively improve the quality of mesh and yield an increased robustness on noisy input mesh.
https://doi.org/10.3837/tiis.2020.11.012 인용 PDF KSCI HTML

Impact localization method for composite structures subjected to temperature fluctuations

Gorgin, Rahim;Wang, Ziping
- Smart Structures and Systems
- /
- v.30 no.4
- /
- pp.371-383
- /
- 2022
A novel impact localization method is presented based on impact induced elastic waves in sensorized composite structure subjected to temperature fluctuations. In real practices, environmental and operational conditions influence the acquired signals and consequently make the feature (particularly Time of Arrival (TOA)) extraction process, complicated and troublesome. To overcome this complication, a robust TOA estimation method is proposed based on the times in which the absolute amplitude of the signal reaches to a specific amplitude value. The presented method requires prior knowledge about the normalized wave velocity in different directions of propagation. To this aim, a finite element model of the plate was built in ABAQUS/CAE. The impact location is then highlighted by calculating an error value at different points of the structure. The efficiency of the developed impact localization technique is experimentally evaluated by dropping steel balls with different energies on a carbon fiber composite plate with different temperatures. It is demonstrated that the developed technique is able to localize impacts with different energies even in the presence of noise and temperature fluctuations.
https://doi.org/10.12989/sss.2022.30.4.371 인용 KSCI

Co-registration of PET-CT Brain Images using a Gaussian Weighted Distance Map (가우시안 가중치 거리지도를 이용한 PET-CT 뇌 영상정합)

Lee, Ho;Hong, Helen;Shin, Yeong-Gil
- Journal of KIISE:Software and Applications
- /
- v.32 no.7
- /
- pp.612-624
- /
- 2005
In this paper, we propose a surface-based registration using a gaussian weighted distance map for PET-CT brain image fusion. Our method is composed of three main steps: the extraction of feature points, the generation of gaussian weighted distance map, and the measure of similarities based on weight. First, we segment head using the inverse region growing and remove noise segmented with head using region growing-based labeling in PET and CT images, respectively. And then, we extract the feature points of the head using sharpening filter. Second, a gaussian weighted distance map is generated from the feature points in CT images. Thus it leads feature points to robustly converge on the optimal location in a large geometrical displacement. Third, weight-based cross-correlation searches for the optimal location using a gaussian weighted distance map of CT images corresponding to the feature points extracted from PET images. In our experiment, we generate software phantom dataset for evaluating accuracy and robustness of our method, and use clinical dataset for computation time and visual inspection. The accuracy test is performed by evaluating root-mean-square-error using arbitrary transformed software phantom dataset. The robustness test is evaluated whether weight-based cross-correlation achieves maximum at optimal location in software phantom dataset with a large geometrical displacement and noise. Experimental results showed that our method gives more accuracy and robust convergence than the conventional surface-based registration.
PDF KSCI

Speaker Identification Using Higher-Order Statistics In Noisy Environment (고차 통계를 이용한 잡음 환경에서의 화자식별)

Shin, Tae-Young;Kim, Gi-Sung;Kwon, Young-Uk;Kim, Hyung-Soon
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.6
- /
- pp.25-35
- /
- 1997
Most of speech analysis methods developed up to date are based on second order statistics, and one of the biggest drawback of these methods is that they show dramatical performance degradation in noisy environments. On the contrary, the methods using higher order statistics(HOS), which has the property of suppressing Gaussian noise, enable robust feature extraction in noisy environments. In this paper we propose a text-independent speaker identification system using higher order statistics and compare its performance with that using the conventional second-order-statistics-based method in both white and colored noise environments. The proposed speaker identification system is based on the vector quantization approach, and employs HOS-based voiced/unvoiced detector in order to extract feature parameters for voiced speech only, which has non-Gaussian distribution and is known to contain most of speaker-specific characteristics. Experimental results using 50 speaker's database show that higher-order-statistics-based method gives a better identificaiton performance than the conventional second-order-statistics-based method in noisy environments.
PDF

2D Industrial Image Registration Method for the Detection of Defects (결함 검출을 위한 2차원 산업 영상 정합 기법)

Lee, Youngjoo;Lee, Jeongjin
- Journal of Korea Multimedia Society
- /
- v.15 no.11
- /
- pp.1369-1376
- /
- 2012
In this paper, we propose 2D industrial image registration method for the detection of defects. Proposed method performs preprocessing to smooth the original image with the preservation of the edge for the robust registration against general noise. Then, x-direction gradient magnitude image and corresponding binary image are generated. Density analysis around neighborhood regions per pixel are performed to generate feature image for preventing mis-registration due to moire-like patterns, which frequently happen in industrial images. Finally, 2D image registration based on phase correlation between feature images is performed to calculate translational parameters to align two images rapidly and optimally. Experimental results showed that the registration accuracy of proposed method for the real industrial images was 100% and our method was about twenty times faster than the previous method. Our fast and accurate method could be used for the real industrial applications.
https://doi.org/10.9717/kmms.2012.15.11.1369 인용 PDF KSCI

Simultaneous Speaker and Environment Adaptation by Environment Clustering in Various Noise Environments (다양한 잡음 환경하에서 환경 군집화를 통한 화자 및 환경 동시 적응)

Kim, Young-Kuk;Song, Hwa-Jeon;Kim, Hyung-Soon
- The Journal of the Acoustical Society of Korea
- /
- v.28 no.6
- /
- pp.566-571
- /
- 2009
This paper proposes noise-robust fast speaker adaptation method based on the eigenvoice framework in various noisy environments. The proposed method is focused on de-noising and environment clustering. Since the de-noised adaptation DB still has residual noise in itself, environment clustering divides the noisy adaptation data into similar environments by a clustering method using the cepstral mean of non-speech segments as a feature vector. Then each adaptation data in the same cluster is used to build an environment-clustered speaker adapted (SA) model. After selecting multiple environmentally clustered SA models which are similar to test environment, the speaker adaptation based on an appropriate linear combination of clustered SA models is conducted. According to our experiments, we observe that the proposed method provides error rate reduction of $40{\sim}59%$ over baseline with speaker independent model.
https://doi.org/10.7776/ASK.2009.28.6.566 인용 PDF KSCI

A Variant of Improved Robust Fuzzy PCA (잡음 민감성이 개선된 변형 퍼지 주성분 분석 기법)

Kim, Seong-Hoon;Heo, Gyeong-Yong;Woo, Young-Woon
- Journal of the Korea Society of Computer and Information
- /
- v.16 no.2
- /
- pp.25-31
- /
- 2011
Principal component analysis (PCA) is a well-known method for dimensionality reduction and feature extraction. Although PCA has been applied in many areas successfully, it is sensitive to outliers due to the use of sum-square-error. Several variants of PCA have been proposed to resolve the noise sensitivity and, among the variants, improved robust fuzzy PCA (RF-PCA2) demonstrated promising results. RF-PCA2, however, still can fall into a local optimum due to equal initial membership values for all data points. Another reason comes from the fact that RF-PCA2 is based on sum-square-error although fuzzy memberships are incorporated. In this paper, a variant of RF-PCA2 called RF-PCA3 is proposed. The proposed algorithm is based on the objective function of RF-PCA2. RF-PCA3 augments RF-PCA2 with the objective function of PCA and initial membership calculation using data distribution, which make RF-PCA3 to have more chance to converge on a better solution than that of RF-PCA2. RF-PCA3 outperforms RF-PCA2, which is demonstrated by experimental results.
https://doi.org/10.9708/jksci.2011.16.2.025 인용 PDF KSCI

Active Frequency with a Positive Feedback Anti-Islanding Method Based on a Robust PLL Algorithm for Grid-Connected PV PCS

Lee, Jong-Pil;Min, Byung-Duk;Kim, Tae-Jin;Yoo, Dong-Wook;Yoo, Ji-Yoon
- Journal of Power Electronics
- /
- v.11 no.3
- /
- pp.360-368
- /
- 2011
This paper proposes an active frequency with a positive feedback in the d-q frame anti-islanding method suitable for a robust phase-locked loop (PLL) algorithm using the FFT concept. In general, PLL algorithms for grid-connected PV PCS use d-q transformation and controllers to make zero an imaginary part of the transformed voltage vector. In a real grid system, the grid voltage is not ideal. It may be unbalanced, noisy and have many harmonics. For these reasons, the d-q transformed components do not have a pure DC component. The controller tuning of a PLL algorithm is difficult. The proposed PLL algorithm using the FFT concept can use the strong noise cancelation characteristics of a FFT algorithm without a PI controller. Therefore, the proposed PLL algorithm has no gain-tuning of a PI controller, and it is hardly influenced by voltage drops, phase step changes and harmonics. Islanding prediction is a necessary feature of inverter-based photovoltaic (PV) systems in order to meet the stringent standard requirements for interconnection with an electrical grid. Both passive and active anti-islanding methods exist. Typically, active methods modify a given parameter, which also affects the shape and quality of the grid injected current. In this paper, the active anti-islanding algorithm for a grid-connected PV PCS uses positive feedback control in the d-q frame. The proposed PLL and anti-islanding algorithm are implemented for a 250kW PV PCS. This system has four DC/DC converters each with a 25kW power rating. This is only one-third of the total system power. The experimental results show that the proposed PLL, anti-islanding method and topology demonstrate good performance in a 250kW PV PCS.
https://doi.org/10.6113/JPE.2011.11.3.360 인용 PDF KSCI

Search Result 155, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)