• 제목/요약/키워드: Spectral transformation

검색결과 146건 처리시간 0.026초

PRECONDITIONED SPECTRAL COLLOCATION METHOD ON CURVED ELEMENT DOMAINS USING THE GORDON-HALL TRANSFORMATION

  • Kim, Sang Dong;Hessari, Peyman;Shin, Byeong-Chun
    • 대한수학회보
    • /
    • 제51권2호
    • /
    • pp.595-612
    • /
    • 2014
  • The spectral collocation method for a second order elliptic boundary value problem on a domain ${\Omega}$ with curved boundaries is studied using the Gordon and Hall transformation which enables us to have a transformed elliptic problem and a square domain S = [0, h] ${\times}$ [0, h], h > 0. The preconditioned system of the spectral collocation approximation based on Legendre-Gauss-Lobatto points by the matrix based on piecewise bilinear finite element discretizations is shown to have the high order accuracy of convergence and the efficiency of the finite element preconditioner.

Maximum mutual information estimation을 이용한 linear spectral transformation 기반의 adaptation (Maximum mutual information estimation linear spectral transform based adaptation)

  • 유봉수;김동현;육동석
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 춘계 학술대회 발표논문집
    • /
    • pp.53-56
    • /
    • 2005
  • In this paper, we propose a transformation based robust adaptation technique that uses the maximum mutual information(MMI) estimation for the objective function and the linear spectral transformation(LST) for adaptation. LST is an adaptation method that deals with environmental noises in the linear spectral domain, so that a small number of parameters can be used for fast adaptation. The proposed technique is called MMI-LST, and evaluated on TIMIT and FFMTIMIT corpora to show that it is advantageous when only a small amount of adaptation speech is used.

  • PDF

기본주파수와 성도길이의 상관관계를 이용한 HTS 음성합성기에서의 목소리 변환 (Voice transformation for HTS using correlation between fundamental frequency and vocal tract length)

  • 유효근;김영관;서영주;김회린
    • 말소리와 음성과학
    • /
    • 제9권1호
    • /
    • pp.41-47
    • /
    • 2017
  • The main advantage of the statistical parametric speech synthesis is its flexibility in changing voice characteristics. A personalized text-to-speech(TTS) system can be implemented by combining a speech synthesis system and a voice transformation system, and it is widely used in many application areas. It is known that the fundamental frequency and the spectral envelope of speech signal can be independently modified to convert the voice characteristics. Also it is important to maintain naturalness of the transformed speech. In this paper, a speech synthesis system based on Hidden Markov Model(HMM-based speech synthesis, HTS) using the STRAIGHT vocoder is constructed and voice transformation is conducted by modifying the fundamental frequency and spectral envelope. The fundamental frequency is transformed in a scaling method, and the spectral envelope is transformed through frequency warping method to control the speaker's vocal tract length. In particular, this study proposes a voice transformation method using the correlation between fundamental frequency and vocal tract length. Subjective evaluations were conducted to assess preference and mean opinion scores(MOS) for naturalness of synthetic speech. Experimental results showed that the proposed voice transformation method achieved higher preference than baseline systems while maintaining the naturalness of the speech quality.

Spectral Folding방법과 GMM 변환을 이용한 대역폭 확장의 Hybrid 방법 (The Hybrid Bandwidth Extenstion Method Using Spectral Folding and GMM Transformation)

  • 최무열;김형순
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 춘계 학술대회 발표논문집
    • /
    • pp.131-134
    • /
    • 2006
  • The narrowband speech over the telephone network is lacking in the information from low-band (0-300 Hz) and high-band (3400-8000 Hz) that are found in wideband speech (0-8000 Hz). As a result, narrowband speech is characterized by the reduced intelligibility and muffled quality, and degraded speaker identification. Spectral folding is the easiest way to reconstruct the missing high-band; however, the reconstructed speech still brings the sense of band-limited characteristic because of the absence of low-band and mid-band frequency components. To compensate for the lack of the extended speech, we propose to combine the spectral folding method and GMM transformation method, which is a statistical method to reconstruct wideband speech. The reconstructed wideband speech showed that the absent frequency components was filled up with relatively low spectral mismatch. According to the subjective speech quality evaluations, the proposed method was preferred to other methods.

  • PDF

A Closed-Form Solution of Linear Spectral Transformation for Robust Speech Recognition

  • Kim, Dong-Hyun;Yook, Dong-Suk
    • ETRI Journal
    • /
    • 제31권4호
    • /
    • pp.454-456
    • /
    • 2009
  • The maximum likelihood linear spectral transformation (ML-LST) using a numerical iteration method has been previously proposed for robust speech recognition. The numerical iteration method is not appropriate for real-time applications due to its computational complexity. In order to reduce the computational cost, the objective function of the ML-LST is approximated and a closed-form solution is proposed in this paper. It is shown experimentally that the proposed closed-form solution for the ML-LST can provide rapid speaker and environment adaptation for robust speech recognition.

다분광 TM 영상 변환기법과 감독분류 정확도 비교연구 -두만강 하류 지역을 중심으로- (Accuracy of Image Transformation Methods and Supervised Classifications on Multi-Spectral TM: A Comparative Study on Lower Tumen River Area)

  • 이기석;남영
    • 한국측량학회지
    • /
    • 제17권3호
    • /
    • pp.311-320
    • /
    • 1999
  • 본 연구에서는 두만강 하류지역 다분광 TM영상의 변환기법과 그에 대한 감독분류방법을 비교 분석하였다. 총체적 분류 정확도는 최대우도법이 높으며 식생은 MNF와 TC 변환 영상에서 비교적 좋은 분류 결과를 얻을 수 있다. MNF, TC, NDVI 등 영상들로 구성된 7차원 영상은 3차원 영상보다 좋은 결과를 나타내며 그 중에서도 최대우도법의 분류 결과가 제일 좋았다. 다분광 영상은 두만강 지역 경제 개발 계획과 산업 입지 선정에 중요한 기초자료로 활용될 수 있다.

  • PDF

굴절률 분산을 반영한 고속 푸리에 변환 및 막두께 정밀결정 (Application of the modified fast fourier transformation weighted with refractive index dispersion far an accurate determination of film thickness)

  • 김상준;김상열
    • 한국광학회지
    • /
    • 제14권3호
    • /
    • pp.266-271
    • /
    • 2003
  • $\mu\textrm{m}$ 이상의 두께를 가지는 비교적 두꺼운 박막의 경우 박막에 의한 간섭효과로 인하여 나타나는 반사율 스펙트럼에서의 진동주기로부터 막의 두께를 얻는다. 대개 빠른 데이터 처리를 위해서 고속 푸리에 변환(Fast Fourier Transformation, FFI)을 사용하여 진동주기(또는 진동수)를 구한다. 본 연구에서는 반사율 또는 투과율 스펙트럼을 빛의 에너지 축상에서 푸리에 변환하는 종래의 방법을 개선하여 박막의 굴절률 분산을 반영하는 수정된 고속 푸리에 변환 방법을 최초로 도입하였다. 이 새로운 방법은 굴절률 분산에서 유래하는 유효굴절률 결정에서의 오차를 줄여주고 푸리에 변환 피크의 폭 넓어짐을 막아줌으로써 막 두께 결정의 정밀도를 크게 향상시킨다. 수정된 고속 푸리에 변환방법을 80 $\mu\textrm{m}$의 덮게층과 13 $\mu\textrm{m}$의 사이층이 있는 시료의 반사 스펙트럼에 적용하여 고 타당성을 확인하였다.

GMM based Nonlinear Transformation Methods for Voice Conversion

  • Vu, Hoang-Gia;Bae, Jae-Hyun;Oh, Yung-Hwan
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.67-70
    • /
    • 2005
  • Voice conversion (VC) is a technique for modifying the speech signal of a source speaker so that it sounds as if it is spoken by a target speaker. Most previous VC approaches used a linear transformation function based on GMM to convert the source spectral envelope to the target spectral envelope. In this paper, we propose several nonlinear GMM-based transformation functions in an attempt to deal with the over-smoothing effect of linear transformation. In order to obtain high-quality modifications of speech signals our VC system is implemented using the Harmonic plus Noise Model (HNM)analysis/synthesis framework. Experimental results are reported on the English corpus, MOCHA-TlMlT.

  • PDF

An Approach to Fuse IKONOS Images by Wavelet Transformation

  • Zhu, Changqing;Wang, Yuhai
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.776-782
    • /
    • 2003
  • This paper develops an approach to fuse 1-meter resolution spatial panchromatic and 4-meter resolution multi-spectral IKONOS images. The approach is based on the characteristics of four-band wavelet transformation. The experiment shows that the fused images based on four-band wavelet method contain with not only high spatial resolution but also rich spectral characteristic.

  • PDF

Effects of Spectral Transformations on Leaf C:N Ratio Inversion with Hyperspectral Data

  • Run-he, SHI;Da-fang, ZHUANG;Qiao-jing, QIAN;Zheng, NIU
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.322-324
    • /
    • 2003
  • Leaf C:N ratio is a new factor in the field of biochemical inversion with hyperspectral data. Effects of common-used spectral transformations including log(R), log(1/R), 1/R, etc. from 400nm to 2490nm on its inversion are compared. Results show that their effects on statistical modeling are not apparent. Continuum removal is used on original reflectance in the range of 2030nm to 2220nm, in which exists an apparent absorption peak due to cellulose, lignin, protein, etc. The effect is distinctive and tends to improve the precision of C:N ratio inversion. Further, it is a robust and physically based transformation.

  • PDF