• Title/Summary/Keyword: 스펙트럼 포락선

Search Result 20, Processing Time 0.019 seconds

The Development of Speech Synthesizer In Korean TTS System (한국어 문어변환 시스템 내에서의 음성 합성기 개발)

  • 강찬희;진용옥
    • The Journal of the Acoustical Society of Korea
    • /
    • v.12 no.2
    • /
    • pp.14-27
    • /
    • 1993
  • 본 논문은 매 40ms 정도의 음성파형으로부터 추출된 6내지 9ms 정도의 1피치주기 파형을 합성단위로 사용하여 합성시킨 시간영역에서의합성방식을 한국어 문어 변환 시스템내에서의 음성합성기에 적용시킨 연구결과이다. 시험 결과, 4가지 유형의 한국어 음절 합성이 가능하고, 장단강약과 같은 운율요소의 제어가 용이하고, 또한 합성 알고리즘이 간단하여 실시간 처리가 가능하였으나, 문장 단위의 음성을 합성하기 위하여는 문장내에서의 다양한 피치 패턴에 대한 연구와 이의 효율적인 제어에 관한 연구가 이루어져야 할 것이다. 합성음에 대한 평가방법으로는 원음과 합성음에 대한 시간영역에서의 파형비교, 주파수 영역에서의 스펙트럼 포락선 유사성 비교 및 합성음에 대한 청취도 실험을 행하였다.

  • PDF

Spectral Modeling of Haegeum Using Cepstral Analysis (캡스트럼 분석을 이용한 해금의 스펙트럼 모델링)

  • Hong, Yeon-Woo;Kang, Myeong-Su;Cho, Sang-Jin;Kim, Jong-Myon;Lee, Jung-Chul;Chong, Ui-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.4
    • /
    • pp.243-250
    • /
    • 2010
  • This paper proposes a spectral modeling of Korean traditional instrument, Haegeum, using cepstral analysis to naturally describe Haegeum sounds varying with time. To get a precise result of cepstral analysis, we set the frame size to 3 periods of input signal and more cepstral coefficients are used to extract formants. The performance is enhanced by flexibly controlling the cutoff frequency of bandpass filter depending on the resonances in the synthesis process of sinusoidal components and the deleting peaks remained in the residual signal. To detect the change of pitch, we divide the input frames into silence, attack, and sustain region and determine which region the current frame is involved in. Then, the proposed method readjusts the frame size according to the fundamental frequency in the case of the current frame is in attack region and corrects the extraction errors of the fundamental frequency for the frames in sustain region. With these processes, the synthesized sounds are much more similar to the originals. The evaluation result through the listening test by a Haegeum player says that the synthesized sounds are almost similar to originals (96~100 % similar to the original sounds).

Performance of 8SQAM System in a Nonlinearly Amplified SCPC-FDMA Channel Interference Environment (비선형 증폭 SCPC-FDMA 채널 간섭 환경에서 8SQAM 시스템의 성능)

  • 성봉훈;서종수
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.7C
    • /
    • pp.678-687
    • /
    • 2003
  • 8SQAM(8-state Superposed Quadrature Amplitude Modulation) being a new modem technique for use in power and bandwidth limited digital communication system generates output signals which have a mか and continuous phase transition and a reduced envelope fluctuation by keeping correlation between amplitudes and phases of two subsequent symbols. Also, 8SQAM signal is free of inter-symbol interference(ISI), and has a compact power spectrum. Accordingly 8SQAM, as compared with a conventional 8PSK, is influenced a little by inter-modulation(IM), inter-symbol interference(ISI) and adjacent channel interference(ACI) in a nonlinearly amplified multi-channel(SCPC-FDMA) environment. In this paper, the performance of 8SQAM system in a nonlinearly amplified multi-channel interference environment is analyzed via computer simulation The simulation result shows that 8SQAM outperforms 8PSK with roll-off value of $\alpha$ = 0.3 by 2.7dB in CNR to maintain BER=1$\times$10$^{-4}$ when input back-off(IBO) of HPA is 1dB and channel space is 41.7% of the data bit rate(i.e., spectral efficiency = 2.40b/s/Hz).

Artificial speech bandwidth extension technique based on opus codec using deep belief network (심층 신뢰 신경망을 이용한 오푸스 코덱 기반 인공 음성 대역 확장 기술)

  • Choi, Yoonsang;Li, Yaxing;Kang, Sangwon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.1
    • /
    • pp.70-77
    • /
    • 2017
  • Bandwidth extension is a technique to improve speech quality, intelligibility and naturalness, extending from the 300 ~ 3,400 Hz narrowband speech to the 50 ~ 7,000 Hz wideband speech. In this paper, an Artificial Bandwidth Extension (ABE) module embedded in the Opus audio decoder is designed using the information of narrowband speech to reduce the computational complexity of LPC (Linear Prediction Coding) and LSF (Line Spectral Frequencies) analysis and the algorithm delay of the ABE module. We proposed a spectral envelope extension method using DBN (Deep Belief Network), one of deep learning techniques, and the proposed scheme produces better extended spectrum than the traditional codebook mapping method.

Source localization technique for metallic impact source by using phase delay between different type sensors (다종 센서간 위상 차이를 이용한 충격 위치추정 기법)

  • Choi, Kyoung-Sik;Choi, Young-Chul;Park, Jin-Ho;Kim, Whan-Woo
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2008.11a
    • /
    • pp.687-692
    • /
    • 2008
  • In a nuclear power plant, loose part monitoring and its diagnostic technique is one of the major issues for ensuring the structural integrity of the reactor system. Typically, accelerometers are mounted on the surface of a reactor vessel to localize impact location caused by the impact of metallic substances on the reactor system. However, in some cases, the number of the accelerometers is not enough to estimate the impact location precisely. In such a case, one of alternative plan is to utilize another type sensors that can measure the vibration of the reactor structure even though the measuring frequency ranges are different from each others. The AE sensors installed on the reactor structure can be utilized as additional sensors for loose part monitoring. In this paper, we proposed a new method to estimate impact location by using both accelerometer signal and AE signal, simultaneously. The feasibility of the proposed method is verified by an experiment. The experimental results demonstrate that we can enhance the reliability and precision of the loose part monitoring.

  • PDF

Source Localization Technique for Metallic Impact Source by Using Phase Delay between Different Type Sensors (다종 센서간 위상 차이를 이용한 충격 위치추정 기법)

  • Choi, Kyoung-Sik;Choi, Young-Chul;Park, Jin-Ho;Kim, Whan-Woo
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.18 no.11
    • /
    • pp.1143-1149
    • /
    • 2008
  • In a nuclear power plant, loose part monitoring and its diagnostic technique is one of the major issues for ensuring the structural integrity of the reactor system. Typically, accelerometers are mounted on the surface of a reactor vessel to localize impact location cavsed by the impact of metallic substances on the reactor system. However, in some cases, the number of the accelerometers is not enough to estimate the impact location precisely. In such a case, one of alternative plan is to utilize another type sensors that can measure the vibration of the reactor structure even though the measuring frequency ranges are different from each others. The AE sensors installed on the reactor structure can be utilized as additional sensors for loose part monitoring. In this paper, we proposed a new method to estimate impact location by using both accelerometer signal and AE signal, simultaneously. The feasibility of the proposed method is verified by an experiment. The experimental results demonstrate that we can enhance the reliability and precision of the loose part monitoring.

Harmonic Signal Linearization of Nonlinear Power Amplifier Using Digital Predistortion for Multiband Wireless Transmitter (다중 대역 송신을 위한 디지털 사전 왜곡 기법을 이용한 비선형 전력 증폭기의 고조파 신호 선형화)

  • Oh, Kyung-Tae;Ku, Hyun-Chul;Kim, Dong-Su;Hahn, Cheol-Koo
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.19 no.12
    • /
    • pp.1339-1349
    • /
    • 2008
  • In this paper, a nonlinear relationship between an input complex envelope and an output complex envelope of m-th harmonic zone is theoretically analyzed, and AM/$AM_m$ and AM/$PM_m$ are defined. A scheme to extract these characteristics from measured in-phase and quadrature-phase data is suggested. The proposed analysis is verified with a fundamental-fundamental and fundamental-third harmonic measurements for a InGaP power amplifier(PA). Based on the harmonic-band nonlinear analysis and extraction scheme, a new technique to send a signal in m-th harmonic band with a harmonic signal Linearization Digital Predistortion(DPD) scheme is presented. A numerical analysis and a Look-Up Table(LUT) based DPD algorithms to linearize output signal on m-th harmonic zone are developed. For a 16- and a 64-QAM input signals, a DPD for third harmonic signal linearization is implemented, and output spectrum and signal constellation are measured. The wholly distorted signals are linearized, and thus the measured Error Vector Magnitudes (EVM) are 6.4 % and 6.5 % respectively. The results show that a proposed scheme linearizes a nonlinearly distorted harmonic band signals. The proposed nonlinear analysis and predistortion scheme can be applied to multiband transmitter in next generation software defined radio(SDR)/cognitive radio(CR) wireless system.

Time-Scale Modification of Polyphonic Audio Signals Using Sinusoidal Modeling (정현파 모델링을 이용한 폴리포닉 오디오 신호의 시간축 변화)

  • 장호근;박주성
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.2
    • /
    • pp.77-85
    • /
    • 2001
  • This paper proposes a method of time-scale modification of polyphonic audio signals based on a sinusoidal model. The signals are modeled with sinusoidal component and noise component. A multiresolution filter bank is designed which splits the input signal into six octave-spaced subbands without aliasing and sinusoidal modeling is applied to each subband signal. To alleviate smearing of transients in time-scale modification a dynamic segmentation method is applied to subbands which determines the analysis-synthesis frame size adaptively to fit time-frequency characteristics of the subband signal. For extracting sinusoidal components and calculating their parameters matching pursuit algorithm is applied to each analysis frame of subband signal. In accordance with spectrum analysis a psychoacoustic model implementing the effect of frequency masking is incorporated with matching pursuit to provide a resonable stop condition of iteration and reduce the number of sinusoids. The noise component obtained by subtracting the synthesized signal with sinusoidal components from the original signal is modeled by line-segment model of short time spectrum envelope. For various polyphonic audio signals the result of simulation shows suggested sinusoidal modeling can synthesize original signal without loss of perceptual quality and do more robust and high quality time-scale modification for large scale factor because of representing transients without any perceptual loss.

  • PDF

Implementation of a Real-time Multipath Fading Channel Simulator Using a Hybrid DSP-FPGA Architecture (DSP-FPGA 구조를 갖는 다중경로 페이딩 채널 시뮬레이터 구현)

  • 이주현;이찬길
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.17-23
    • /
    • 2004
  • The mobile radio channel can be simulated as a complex-valued random process with narrow-band spectrum. This paper describes a real-time implementation of that process using a INS320C6414 digital signal processor and XC2VP30 Virtex FPGA. The simulator presented here is not only a comprehensive model of the flat fading but also frequency selective fading mobile channel conditions. To replicate the statistical characteristics of the multipath fading environment with the minimum computational burden, multi-rate techniques are employed to resolve practical problems such as variable sampling rate. The simulator produces accurate and consistent results due to digital implementation. It is very flexible and simple to program for various field conditions in mobile communications with a graphical user interface.

Optical properties of Nb2O5 thin films prepared by ion beam assisted deposition (이온빔 보조 증착 Nb2O5 박막의 광학적 특성)

  • 우석훈;남성림;정부영;황보창권;문일춘
    • Korean Journal of Optics and Photonics
    • /
    • v.13 no.2
    • /
    • pp.105-112
    • /
    • 2002
  • We studied the optical and structural properties of conventional and ion-beam-assisted-deposition (IBAD) Nb$_2$O$_{5}$ films which were evaporated by an electron beam gun. The vacuum-to-air spectral shift and the cross sectional SEM images of the Nb$_2$O$_{5}$ films were investigated. The results show that the IBAD Nb$_2$O$_{5}$ films have a higher packing density than the conventional Nb$_2$O$_{5}$ films. The average refractive index of IBAD Nb$_2$O$_{5}$ films was increased, while the extinction coefficient was decreased compared with the conventional films. As the oxygen flow was increased, the average refractive index and extinction coefficient of the conventional and IBAD films decreased. Both the conventional and IBAD Nb$_2$O$_{5}$ films showed inhomogeneity in refractive index, and the degree of inhomogeneity of the IBAD Nb$_2$O$_{5}$ films became larger as the ion current density was increased. All Nb$_2$O$_{5}$ films were found to be amorphous by x-ray diffraction (XRD) analysis, and hence the crystal structure of Nb$_2$O$_{5}$ films was not changed by IBAD.