• Title/Summary/Keyword: 스펙트럼 완만화

Search Result 6, Processing Time 0.026 seconds

Speech Synthesis using Diphone Clustering and Improved Spectral Smoothing (다이폰 군집화와 개선된 스펙트럼 완만화에 의한 음성합성)

  • Jang, Hyo-Jong;Kim, Kwan-Jung;Kim, Gye-Young;Choi, Hyung-Il
    • The KIPS Transactions:PartB
    • /
    • v.10B no.6
    • /
    • pp.665-672
    • /
    • 2003
  • This paper describes a speech synthesis technique by concatenating unit phoneme. At that time, a major problem is that discontinuity is happened from connection part between unit phonemes, especially from connection part between unit phonemes recorded by different persons. To solve the problem, this paper uses clustered diphone, and proposes a spectral smoothing technique, not only using formant trajectory and distribution characteristic of spectrum but also reflecting human's acoustic characteristic. That is, the proposed technique performs unit phoneme clustering using distribution characteristic of spectrum at connection part between unit phonemes and decides a quantity and a scope for the smoothing by considering human's acoustic characteristic at the connection part of unit phonemes, and then performs the spectral smoothing using weights calculated along a time axes at the border of two diphones. The proposed technique removes the discontinuity and minimizes the distortion which can be occurred by spectrum smoothing. For the purpose of the performance evaluation, we test on five hundred diphones which are extracted from twenty sentences recorded by five persons, and show the experimental results.

Improvement of Synthetic Speech Quality using a New Spectral Smoothing Technique (새로운 스펙트럼 완만화에 의한 합성 음질 개선)

  • 장효종;최형일
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.11
    • /
    • pp.1037-1043
    • /
    • 2003
  • This paper describes a speech synthesis technique using a diphone as an unit phoneme. Speech synthesis is basically accomplished by concatenating unit phonemes, and it's major problem is discontinuity at the connection part between unit phonemes. To solve this problem, this paper proposes a new spectral smoothing technique which reflects not only formant trajectories but also distribution characteristics of spectrum and human's acoustic characteristics. That is, the proposed technique decides the quantity and extent of smoothing by considering human's acoustic characteristics at the connection part of unit phonemes, and then performs spectral smoothing using weights calculated along a time axis at the border of two diphones. The proposed technique reduces the discontinuity and minimizes the distortion which is caused by spectral smoothing. For the purpose of performance evaluation, we tested on five hundred diphones which are extracted from twenty sentences using ETRI Voice DB samples and individually self-recorded samples.

Power Spectral Density of Antipodal Ultra Wideband Signal (Antipodal 초광대역(UWB) 신호의 전력 스펙트럼 밀도 분석)

  • Kim, Jong Han;Lee, Jung Suk;Kim, Yoo Chang;Kim, Won Hoo;Kim, Jung Sun
    • Journal of Advanced Navigation Technology
    • /
    • v.5 no.1
    • /
    • pp.54-61
    • /
    • 2001
  • In conventional Ultra Wide Band(UWB) system, it uses Pulse Positioning Modulation Method to modulate data signal. In this paper, however, we derive power spectral density characteristic of time hopped antipodal signal using stochastic process. UWB signal employes Gaussian monopulse and Rayleigh monopulse which pulse width is 0.5 nsec and interval is 5 nsec. But comb line which produces unintentionally could be evidently reduced by the time hopped code, so this code be used to channelize for multiple access and minimize to different communication system.

  • PDF

Voice Personality Transformation Using a Probabilistic Method (확률적 방법을 이용한 음성 개성 변환)

  • Lee Ki-Seung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.150-159
    • /
    • 2005
  • This paper addresses a voice personality transformation algorithm which makes one person's voices sound as if another person's voices. In the proposed method, one person's voices are represented by LPC cepstrum, pitch period and speaking rate, the appropriate transformation rules for each Parameter are constructed. The Gaussian Mixture Model (GMM) is used to model one speaker's LPC cepstrums and conditional probability is used to model the relationship between two speaker's LPC cepstrums. To obtain the parameters representing each probabilistic model. a Maximum Likelihood (ML) estimation method is employed. The transformed LPC cepstrums are obtained by using a Minimum Mean Square Error (MMSE) criterion. Pitch period and speaking rate are used as the parameters for prosody transformation, which is implemented by using the ratio of the average values. The proposed method reveals the superior performance to the previous VQ-based method in subjective measures including average cepstrum distance reduction ratio and likelihood increasing ratio. In subjective test. we obtained almost the same correct identification ratio as the previous method and we also confirmed that high qualify transformed speech is obtained, which is due to the smoothly evolving spectral contours over time.

The study to develope of optical glass(LaF, KzFS1, LBO) (광학유리(LaF, KzFS1, LBO) 개발에 관한 연구)

  • Cha, Jung Won;Park, Moon Chan
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.5 no.1
    • /
    • pp.19-30
    • /
    • 2000
  • LaF, KzFS1, LBO glass were manufactured successfully by using platinum crucible in LaF and using clay crucible in the KzFS1 and LBO. There was optically transparent and the refractive indexes were measured by minum deviation method of prism. LaF, KzFS1, LBO show that the refrective indexes are $n_d$ = 1.770 in LaF, $n_d$ = 1.603 in KzFS1, $n_d$ = 1.560 in LBO. The transmittance were obtained that LaF has 85%, and KzFS1 has 83% and LBO has 89% in visible range. These glasses have no any absorption spectrum under visible range. Therefore it has no any color.

  • PDF

Geomagnetic Field Properties and Magnetic Interpretation in the Southern Part of the Ulleung Basin (鬱陵盆地 남단해역의 地磁場 特性 및 磁氣異常 解析)

  • 박찬홍;석봉출
    • 한국해양학회지
    • /
    • v.26 no.2
    • /
    • pp.117-132
    • /
    • 1991
  • Marine total magnetic intensity over the southern part of the Ulleung Basin and geomagnetic data measured at a land base station are analyzed. Fourteen days observation of geomagnetic field at a fixed on-land base station showed how the geomagnetic field around the study area behaves. geomagnetic data at the base station can also be used as correction data for a diurnal variation. Magnetic anomalies in the study area do not reflect an effect of sea bottom topography but mainly subsurface basement. The southern part of the Ulleung Basin can be devided into two zones according to a different anomaly pattern; along the coastal shelves the isolated anomalies with a short wave and a strong amplitude are dominant, and toward the open sea the anomalies become much more subdued. The high anomaly zone adjoined to land is interpreted to be caused by granitic intrusives or volcanic rocks, and the weak anomaly zone to the outer sea to be arisen from an existence of deep basement. A spectrum analysis is applied to estimate magnetic basement depths from three anomaly profiles with a long period and a weak amplitude toward the outer sea. The calculated depths are 7.0km, 5.0km, and 2.6km respectively from outer profile. The basement might be correlated with the mixed layer of tuff, basalt, and sediment, which had been defined as L-2 layer in the Yamato basin and the Japan Basin.

  • PDF