• 제목/요약/키워드: Spectral weighted

검색결과 98건 처리시간 0.021초

주파수가중 스펙트럼성형필터와 위너필터를 결합한 음성 스펙트럼 강조 (Speech Spectrum Enhancement Combined with Frequency-weighted Spectrum Shaping Filter and Wiener Filter)

  • 최재승
    • 한국정보통신학회논문지
    • /
    • 제20권10호
    • /
    • pp.1867-1872
    • /
    • 2016
  • 디지털신호처리 분야에서 다양한 환경에서 존재하는 배경잡음을 제거하여 음성신호의 품질을 개선시키는 것은 반드시 필요한 문제이다. 음향학적으로 배경잡음을 제거할 때 고려해야 할 중요한 점은 인간의 청각기전이 주로 음성의 진폭 스펙트럼의 정보에 의존하여 문제 해결을 하고 있다는 사실이다. 본 논문에서는 음성의 진폭 스펙트럼의 추출을 주요 목적으로 하는 주파수가중 스펙트럼성형필터의 특성을 도입한다. 따라서 본 논문에서는 배경잡음으로 중첩된 음성신호 성분 중에 이 진폭 스펙트럼 정보를 추출하여, 위너 필터법과 음향학적인 모델에 의한 주파수가중 스펙트럼성형필터를 사용한 알고리즘을 제안한다. 본 실험에서는 스펙트럼 왜곡률(SD)에 의하여 제안한 알고리즘의 출력 SD가 기존의 다른 방법과 비교하여 약 5.28 dB 이상 개선되었다.

연속음에서의 각 음소의 대표구간 추출에 관한 연구 (A study on extraction of the frames representing each phoneme in continuous speech)

  • 박찬응;이쾌희
    • 전자공학회논문지B
    • /
    • 제33B권4호
    • /
    • pp.174-182
    • /
    • 1996
  • In continuous speech recognition system, it is possible to implement the system which can handle unlimited number of words by using limited number of phonetic units such as phonemes. Dividing continuous speech into the string of tems of phonemes prior to recognition process can lower the complexity of the system. But because of the coarticulations between neiboring phonemes, it is very difficult ot extract exactly their boundaries. In this paper, we propose the algorithm ot extract short terms which can represent each phonemes instead of extracting their boundaries. The short terms of lower spectral change and higher spectral chang eare detcted. Then phoneme changes are detected using distance measure with this lower spectral change terms, and hgher spectral change terms are regarded as transition terms or short phoneme terms. Finally lower spectral change terms and the mid-term of higher spectral change terms are regarded s the represent each phonemes. The cepstral coefficients and weighted cepstral distance are used for speech feature and measuring the distance because of less computational complexity, and the speech data used in this experimetn was recoreded at silent and ordinary in-dorr environment. Through the experimental results, the proposed algorithm showed higher performance with less computational complexity comparing with the conventional segmetnation algorithms and it can be applied usefully in phoneme-based continuous speech recognition.

  • PDF

Weighted Least-Squares Design and Parallel Implementation of Variable FIR Filters

  • Deng, Tian-Bo
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 ITC-CSCC -1
    • /
    • pp.686-689
    • /
    • 2002
  • This paper proposes a weighted least-squares(WLS) method for designing variable one-dimensional (1-D) FIR digital filters with simultaneously variable magnitude and variable non-integer phase-delay responses. First, the coefficients of a variable FIR filter are represented as the two-dimensional (2-D) polynomials of a pair of spectral parameters: one is for tuning the magnitude response, and the other is for varying its non-integer phase-delay response. Then the optimal coefficients of the 2-D polynomials are found by minimizing the total weighted squared error of the variable frequency response. Finally, we show that the resulting variable FIR filter can be implemented in a parallel form, which is suitable for high-speed signal processing.

  • PDF

Toward Practical Augmentation of Raman Spectra for Deep Learning Classification of Contamination in HDD

  • Seksan Laitrakun;Somrudee Deepaisarn;Sarun Gulyanon;Chayud Srisumarnk;Nattapol Chiewnawintawat;Angkoon Angkoonsawaengsuk;Pakorn Opaprakasit;Jirawan Jindakaew;Narisara Jaikaew
    • Journal of information and communication convergence engineering
    • /
    • 제21권3호
    • /
    • pp.208-215
    • /
    • 2023
  • Deep learning techniques provide powerful solutions to several pattern-recognition problems, including Raman spectral classification. However, these networks require large amounts of labeled data to perform well. Labeled data, which are typically obtained in a laboratory, can potentially be alleviated by data augmentation. This study investigated various data augmentation techniques and applied multiple deep learning methods to Raman spectral classification. Raman spectra yield fingerprint-like information about chemical compositions, but are prone to noise when the particles of the material are small. Five augmentation models were investigated to build robust deep learning classifiers: weighted sums of spectral signals, imitated chemical backgrounds, extended multiplicative signal augmentation, and generated Gaussian and Poisson-distributed noise. We compared the performance of nine state-of-the-art convolutional neural networks with all the augmentation techniques. The LeNet5 models with background noise augmentation yielded the highest accuracy when tested on real-world Raman spectral classification at 88.33% accuracy. A class activation map of the model was generated to provide a qualitative observation of the results.

회체가스중합법에 의한 수증기의 파장별 복사물성치 모델에 관한 연구 (A study on the wsggm-based spectral modeling of radiation properties of water vapor)

  • 김욱중;송태호
    • 대한기계학회논문집B
    • /
    • 제20권10호
    • /
    • pp.3371-3380
    • /
    • 1996
  • 각종 노(furnace)를 포함하여 보일러, 가스터빈, 우주선 추진기구, 원자로 및 연료전지 등 고온이 열전달 문제를 다루는데 있어서 복사 열전달 연구는 매우 중요하다. 이러한 복사열전달에 대한 연구동향은 최근 복사 물성치(특히 가스에서의)를 예측하기 위한 이론 모델의 개발 및 측정분야와 복사전달 방정식의 해를 비교적 간단하게 구하기 위한 근사해법에 대한 연구로 크게 대별되고 있다. 이러한 두가지 연구방햐은 완전히 분리, 독립되어 있는 것은 아니며 서로 많은 연관성을 가지고 있다. 특히, 비회체가스(nongray gas)의 복사 성질에 대한 모델링은 근사 해법의 계산결과에 큰 영향을 미치게 되므로 가스의 복사 성질의 예측은 매우 중요한 연구 과제가 된고 있다. a Low resolution spectral modeling of water vapor is carried out by applying the weighted-sum-of-gray-gases model (WSGGM) to a narrow band. For a given narrow band, focus is placed on proper modeling of gray gas absorption coefficients vs. temeprature relation used for any solution methods for the Radiative Transfer Equation(RTE). Comparison between the modeled emissivity and the "true" emissivity obtained from a high temperatue statistical narrow band parameters is made ofr the total spectrum as well as for a few typical narrow bands. Application of the model to nonuniform gas layers is also made. Low resolution spectral intensities at the boundary are obtained for uniform, parabolic and boundary layer type temeprature profiles using the obtained for uniform, parabolic and boundary layer type temperature profiles using the obtained WSGGM's with 9 gray gases. The results are compared with the narrow band spectral intensities as obtained by a narrow band model-based code with the Curtis-Godson approximation. Good agreement is found between them. Local heat source strength and total wall heat flux are also compared for the cases of Kim et al, which again gives promising agreement.

SPECTRAL-COLLOCATION METHOD FOR FRACTIONAL FREDHOLM INTEGRO-DIFFERENTIAL EQUATIONS

  • Yang, Yin;Chen, Yanping;Huang, Yunqing
    • 대한수학회지
    • /
    • 제51권1호
    • /
    • pp.203-224
    • /
    • 2014
  • We propose and analyze a spectral Jacobi-collocation approximation for fractional order integro-differential equations of Fredholm-Volterra type. The fractional derivative is described in the Caputo sense. We provide a rigorous error analysis for the collection method, which shows that the errors of the approximate solution decay exponentially in $L^{\infty}$ norm and weighted $L^2$-norm. The numerical examples are given to illustrate the theoretical results.

ON IMPROVING THE PERFORMANCE OF CODED SPECTRAL PARAMETERS FOR SPEECH RECOGNITION

  • Choi, Seung-Ho;Kim, Hong-Kook;Lee, Hwang-Soo
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
    • /
    • pp.250-253
    • /
    • 1998
  • In digital communicatioin networks, speech recognition systems conventionally reconstruct speech followed by extracting feature [parameters. In this paper, we consider a useful approach by incorporating speech coding parameters into the speech recognizer. Most speech coders employed in the networks represent line spectral pairs as spectral parameters. In order to improve the recognition performance of the LSP-based speech recognizer, we introduce two different ways: one is to devise weighed distance measures of LSPs and the other is to transform LSPs into a new feature set, named a pseudo-cepstrum. Experiments on speaker-independent connected-digit recognition showed that the weighted distance measures significantly improved the recognition accuracy than the unweighted one of LSPs. Especially we could obtain more improved performance by using PCEP. Compared to the conventional methods employing mel-frequency cepstral coefficients, the proposed methods achieved higher performance in recognition accuracies.

  • PDF

MR Brain Image Segmentation Using Clustering Technique

  • Yoon, Ock-Kyung;Kim, Dong-Whee;Kim, Hyun-Soon;Park, Kil-Houm
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -1
    • /
    • pp.450-453
    • /
    • 2000
  • In this paper, an automated segmentation algorithm is proposed for MR brain images using T1-weighted, T2-weighted, and PD images complementarily. The proposed segmentation algorithm is composed of 3 steps. In the first step, cerebrum images are extracted by putting a cerebrum mask upon the three input images. In the second step, outstanding clusters that represent inner tissues of the cerebrum are chosen among 3-dimensional (3D) clusters. 3D clusters are determined by intersecting densely distributed parts of 2D histogram in the 3D space formed with three optimal scale images. Optimal scale image best describes the shape of densely distributed parts of pixels in 2D histogram. In the final step, cerebrum images are segmented using FCM algorithm with it’s initial centroid value as the outstanding cluster’s centroid value. The proposed segmentation algorithm complements the defect of FCM algorithm, being influenced upon initial centroid, by calculating cluster’s centroid accurately And also can get better segmentation results from the proposed segmentation algorithm with multi spectral analysis than the results of single spectral analysis.

  • PDF

Improvement of COMS Land Surface Temperature Retrieval Algorithm

  • Hong, Ki-Ok;Suh, Myoung-Seok;Kang, Jeon-Ho
    • 대한원격탐사학회지
    • /
    • 제25권6호
    • /
    • pp.507-515
    • /
    • 2009
  • Land surface temperature (LST) is a key environmental variable in a wide range of applications, such as weather, climate, hydrology, and ecology. However, LST is one of the most difficult surface variables to observe regularly due to the strong spatio-temporal variations. So, we have developed the LST retrieval algorithm from COMS (Communication, Ocean and Meteorological Satellite) data through the radiative transfer simulations under various atmospheric profiles (TIGR data), satellite zenith angle (SZA), spectral emissivity, and surface lapse rate conditions using MODTRAN 4. However, the LST retrieval algorithm has a tendency to overestimate and underestimate the LST for surface inversion and superadiabatic conditions, respectively. To minimize the overestimation and underestimation of LST, we also developed day/night LST algorithms separately based on the surface lapse rate (local time) and recalculated the final LST by using the weighted sum of day/night LST. The analysis results showed that the quality of weighted LST of day/night algorithms is greatly improved compared to that of LST estimated by original algorithm regardless of the surface lapse rate, spectral emissivity difference (${\Delta}{\varepsilon}$) SZA, and atmospheric conditions. In general, the improvements are greatest when the surface lapse rate and ${\Delta}{\varepsilon}$ are negatively large (strong inversion conditions and less vegetated surface).

Radionuclide identification based on energy-weighted algorithm and machine learning applied to a multi-array plastic scintillator

  • Hyun Cheol Lee ;Bon Tack Koo ;Ju Young Jeon ;Bo-Wi Cheon ;Do Hyeon Yoo ;Heejun Chung;Chul Hee Min
    • Nuclear Engineering and Technology
    • /
    • 제55권10호
    • /
    • pp.3907-3912
    • /
    • 2023
  • Radiation portal monitors (RPMs) installed at airports and harbors to prevent illicit trafficking of radioactive materials generally use large plastic scintillators. However, their energy resolution is poor and radionuclide identification is nearly unfeasible. In this study, to improve isotope identification, a RPM system based on a multi-array plastic scintillator and convolutional neural network (CNN) was evaluated by measuring the spectra of radioactive sources. A multi-array plastic scintillator comprising an assembly of 14 hexagonal scintillators was fabricated within an area of 50 × 100 cm2. The energy spectra of 137Cs, 60Co, 226Ra, and 4K (KCl) were measured at speeds of 10-30 km/h, respectively, and an energy-weighted algorithm was applied. For the CNN, 700 and 300 spectral images were used as training and testing images, respectively. Compared to the conventional plastic scintillator, the multi-arrayed detector showed a high collection probability of the optical photons generated inside. A Compton maximum peak was observed for four moving radiation sources, and the CNN-based classification results showed that at least 70% was discriminated. Under the speed condition, the spectral fluctuations were higher than those under dwelling condition. However, the machine learning results demonstrated that a considerably high level of nuclide discrimination was possible under source movement conditions.