• 제목/요약/키워드: Spectral parameter

검색결과 308건 처리시간 0.028초

비트율-왜곡 기반 음성 신호 시간축 분할 (A Temporal Decomposition Method Based on a Rate-distortion Criterion)

  • 이기승
    • 한국음향학회지
    • /
    • 제21권3호
    • /
    • pp.315-322
    • /
    • 2002
  • 본 논문에서는 음성 신호 시간축 분할의 새로운 기법으로, 비트율과 왜곡을 함께 고려한 기법이 제안되었다. 시간축 분할에 필요한 보간 함수는 학습 음성 데이터로부터 얻어진다. 보간 함수는 두 타겟간의 길이에 따라 유일하게 결정되므로 보간 함수는 추가 정보없이 표현된다. 타겟 샘플은 비트율을 최소화시키면서 동시에 최대 스펙트럼 오차가 문턱 치보다 작게 되도록 선택하였다. 제안된 기법은 음성 부호화기의 스펙트럼 변수로 널리 사용되는 LSP계수의 부호화에 적용되었으며, 모의실험 결과 평균적으로 8 bits/Frame의 비트율에서 1.4 dB의 스펙트럼 왜곡이 얻어짐을 알 수 있었다.

DSP를 이용한 자동차 소음에 강인한 음성인식기 구현 (Implementation of a Robust Speech Recognizer in Noisy Car Environment Using a DSP)

  • 정익주
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.67-77
    • /
    • 2008
  • In this paper, we implemented a robust speech recognizer using the TMS320VC33 DSP. For this implementation, we had built speech and noise database suitable for the recognizer using spectral subtraction method for noise removal. The recognizer has an explicit structure in aspect that a speech signal is enhanced through spectral subtraction before endpoints detection and feature extraction. This helps make the operation of the recognizer clear and build HMM models which give minimum model-mismatch. Since the recognizer was developed for the purpose of controlling car facilities and voice dialing, it has two recognition engines, speaker independent one for controlling car facilities and speaker dependent one for voice dialing. We adopted a conventional DTW algorithm for the latter and a continuous HMM for the former. Though various off-line recognition test, we made a selection of optimal conditions of several recognition parameters for a resource-limited embedded recognizer, which led to HMM models of the three mixtures per state. The car noise added speech database is enhanced using spectral subtraction before HMM parameter estimation for reducing model-mismatch caused by nonlinear distortion from spectral subtraction. The hardware module developed includes a microcontroller for host interface which processes the protocol between the DSP and a host.

  • PDF

Inference of Chromospheric Plasma Parameters on the Sun from Strong Absorption Lines

  • Chae, Jongchul;Madjarska, Maria S.;Kwak, Hannah;Cho, Kyuhyoun
    • 천문학회보
    • /
    • 제45권1호
    • /
    • pp.44.4-45
    • /
    • 2020
  • The solar chromosphere can be observed well through strong absorption lines. We infer the physical parameters of chromospheric plasmas from these lines using a multilayer spectral inversion. This is a new technique of spectral inversion. We assume that the atmosphere consists of a finite number of layers. In each layer the absorption profile is constant and the source function is allowed to vary with optical depth. Specifically, we consider a three-layer model of radiative transfer where the lowest layer is identified with the photosphere and the two upper layers are identified with the chromosphere. This three-layer model is fully specified by 13 parameters. Four parameters can be fixed to prescribed values, and one parameter can be determined from the analysis of a satellite photospheric line. The remaining eight parameters are determined from a constrained least-squares fitting. We applied the multilayer spectral inversion to the spectral data of the Hα and the Ca II 854.21 nm lines taken in a quiet region by the Fast Imaging Solar Spectrograph (FISS) of the Goode Solar Telescope (GST). We find that our model successfully fits most of the observed profiles and produces regular maps of the model parameters. We conclude that our multilayer inversion is useful to infer chromospheric plasma parameters on the Sun.

  • PDF

적응예측기를 이용하여 잡음섞인 음성신호로부터 autoregressive 계수를 추산하는 방법 (An Autoregressive Parameter Estimation from Noisy Speech Using the Adaptive Predictor)

  • 구본응
    • 한국음향학회지
    • /
    • 제14권3호
    • /
    • pp.90-96
    • /
    • 1995
  • 잡음섞인 관측데이타로부터 AR 모수를 추정하는 방법을 제안하였다. AP 방법이라고 이름붙인 이 방법은 단순하고도 신뢰성있는 적응예측기를 이용하려는 시도의 산물이다. 잡음섞인 입력수열로부터 계산된 AR 모수의 추정치보다 예측수열로부터 계산된 AR 모수의 추정치가 원래의 모수에 스펙트럼상의 거리가 더 가깝다는 것을 이론적으로 증명하였다. 실제 음성 신호와 칼만필터를 사용한 실험결과도 이론과 일치함을 보였다. 대략적으로, AP방법으로 계산된 추정치를 사용하였을때의 잡음감쇠성능은 잡음섞인 입력수열로부터 계산된 AP 모수의 추정치를 사용하였을때보다는 우수하였고, EM반복법에 의한 추정치를 사용하였을때보다는 약간 못한 것으로 나타났다. 그러나, 제안된 방법은 그 단순성으로 인하여 경우에 따라 더 복잡한 다른 방법의 대안으로 사용될 수 있을 것이다.

  • PDF

디지털 카메라를 활용한 컬러 지상영상의 분광학적 특성 분석 (Analysis of the spectroscopic characteristics of Ground color images using a digital camera)

  • 고인철;서수영
    • 한국GIS학회:학술대회논문집
    • /
    • 한국GIS학회 2010년도 춘계학술대회
    • /
    • pp.137-144
    • /
    • 2010
  • DSLR 카메라를 이용하여 획득한 지상 디지털 영상자료는 지상 사진 측량, 공간모델링에 활용할 수 있다. 지상 디지털 영상에서 각 화소의 명암도는 영상을 결정하는 가장 중요한 매개변수(parameter)이다. 따라서 좀 더 명확한 명암도의 수치 자료를 획득하고 활용하기 위하여 디지털 카메라의 분광학적 특성과 파라미터를 추정해볼 필요가 있다. 본 연구에서는 Sony DSC-F828 DSLR 카메라로 연속촬영(프레임속도 0.38초)을 통하여 얻은 7장의 같은 디지털 컬러 사진으로부터 각각 RGB 밴드의 명암도 값을 추출하여 프레임 간 화소별 명암도 차이를 확인하고, 각 컬러 밴드에 대한 각 화소의 통계학적인 분석을 통하여 분광학적인 특성의 프레임별, 화소별, 밴드별 변화와 그에 따른 상관관계에 대하여 추정해 보는 것을 목적으로 한다.

  • PDF

An Assessment of a Random Forest Classifier for a Crop Classification Using Airborne Hyperspectral Imagery

  • Jeon, Woohyun;Kim, Yongil
    • 대한원격탐사학회지
    • /
    • 제34권1호
    • /
    • pp.141-150
    • /
    • 2018
  • Crop type classification is essential for supporting agricultural decisions and resource monitoring. Remote sensing techniques, especially using hyperspectral imagery, have been effective in agricultural applications. Hyperspectral imagery acquires contiguous and narrow spectral bands in a wide range. However, large dimensionality results in unreliable estimates of classifiers and high computational burdens. Therefore, reducing the dimensionality of hyperspectral imagery is necessary. In this study, the Random Forest (RF) classifier was utilized for dimensionality reduction as well as classification purpose. RF is an ensemble-learning algorithm created based on the Classification and Regression Tree (CART), which has gained attention due to its high classification accuracy and fast processing speed. The RF performance for crop classification with airborne hyperspectral imagery was assessed. The study area was the cultivated area in Chogye-myeon, Habcheon-gun, Gyeongsangnam-do, South Korea, where the main crops are garlic, onion, and wheat. Parameter optimization was conducted to maximize the classification accuracy. Then, the dimensionality reduction was conducted based on RF variable importance. The result shows that using the selected bands presents an excellent classification accuracy without using whole datasets. Moreover, a majority of selected bands are concentrated on visible (VIS) region, especially region related to chlorophyll content. Therefore, it can be inferred that the phenological status after the mature stage influences red-edge spectral reflectance.

잡음 환경 분류 알고리즘을 이용한 IMCRA 기반의 음성 향상 기법 (Speech Enhancement Based on IMCRA Incorporating noise classification algorithm)

  • 송지현;박규석;안홍섭;이상민
    • 전기학회논문지
    • /
    • 제61권12호
    • /
    • pp.1920-1925
    • /
    • 2012
  • In this paper, we propose a novel method to improve the performance of the improved minima controlled recursive averaging (IMCRA) in non-stationary noisy environment. The conventional IMCRA algorithm efficiently estimate the noise power by averaging past spectral power values based on a smoothing parameter that is adjusted by the signal presence probability in frequency subbands. Since the minimum of smoothing parameter is defined as 0.85, it is difficult to obtain the robust estimates of the noise power in non-stationary noisy environments that is rapidly changed the spectral characteristics such as babble noise. For this reason, we proposed the modified IMCRA, which adaptively estimate and updata the noise power according to the noise type classified by the Gaussian mixture model (GMM). The performances of the proposed method are evaluated by perceptual evaluation of speech quality (PESQ) and composite measure under various environments and better results compared with the conventional method are obtained.

비정체성 잡음을 위한 SPD-TE 기반 계수형 음성 활동 탐지 (A Parametric Voice Activity Detection Based on the SPD-TE for Nonstationary Noises)

  • 구본응
    • 한국음향학회지
    • /
    • 제34권4호
    • /
    • pp.310-315
    • /
    • 2015
  • 본 논문에서는 비정체성(nonstationary) 잡음 환경을 위한 단일 채널 VAD(Voice Activity Detection) 알고리듬 제안하였다. VAD 판별을 위한 특징계수의 임계값은 과거 비음성 프레임들의 평균과 표준편차를 추산하여 적응적으로 갱신하였다. 특징계수로는 SPD-TE(Spectral Power Difference-Teager Energy)를 사용했는데, 이것은 WPD(Wavelet Packet Decomposition) 계수에 Teager 에너지를 적용한 것으로서 잡음에 강인한 것으로 보고된 바 있다. TIMIT 음성과 NOISEX-92 잡음을 사용하여 10 dB부터 -10 dB까지의 SNR에 대한 실험 결과, 제안된 알고리듬이 표준을 포함한 기존의 알고리듬과 비슷한 정확도를 보였다.

Distortion Compensation of WDM Signals with initial frequency chirp in the Modified Mid-Span Spectral Inversion Technique

  • Lee, Seong-Real
    • Journal of information and communication convergence engineering
    • /
    • 제5권1호
    • /
    • pp.17-22
    • /
    • 2007
  • In this paper, the optimal value of optical phase conjugator (OPC) position and the optimal values of dispersion coefficients of fiber sections for the best compensation of the distorted WDM signals with frequency chirp of -1 are induced to alternate with the symmetrical distributions of power and local dispersion with respect to OPC, which is difficult to form in real optical link due to fiber attenuation in mid-span spectral inversion (MSSI) technique. It is confirmed that the Q-factors of total channels of -18.5 dBm launching light power exceed 16.9 dB, which value corresponds to 10-12 BER, by applying the induced optimal parameter values into 16 channels ${\times}$ 40 Gbps WDM system, on the other hand the Q-factors of only 9 channels exceed that value in WDM system with the conventional MSSI technique. Thus, it is expected to expand the availability of OPC in WDM system through the using of the optimal parameter values that are induced by the proposed method in this paper, without the symmetrical distributions of power and local dispersion.