• Title/Summary/Keyword: voice parameter

Search Result 179, Processing Time 0.025 seconds

Optimum Parameter and Performance Analysis of Outer Loop Power Control in IMT-2000 (IMT-2000 외부회로 전력제어의 최적변수 및 성능 분석)

  • 이재성;장영민;전기준;임순용
    • Proceedings of the IEEK Conference
    • /
    • 2000.11a
    • /
    • pp.121-124
    • /
    • 2000
  • In IMT-2000 systems, the outer loop dynamically adjusts the target SIR so that adequate performance in terms of the frame error rate(FER) and the true quality measure is achieved. This paper utilizes an analytic model lot outer loop power control(OLPC) adjusting the target SIR in IMT-2000. The analytic model is based on the discrete-time Markov chain as voice traffic SIR. It is described that the model can be used to find the optimum step size in voice traffic for fast fading environments. The optimum step size influences the performance of OLPC: As the step size decreases, the average target SIR increases and average FER decreases.

  • PDF

Voice Source Estimation Using Robust Sequential SVD (견실 순차 특이치분해를 이용한 음원추정)

  • 홍성훈
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1993.06a
    • /
    • pp.75-79
    • /
    • 1993
  • 본 논문에서는 변화가 심한 음원파형을 추정하는 새로운 순차처리 알고리듬을 제안한다. 먼저, 1) 기존의 순차처리 분석법중 대표적인 분석법인 RLS(recursive least square)의 문제점들을 검토하고, 2) 이를 개선하기 위해서 관측행렬(observation matrix)을 최적차수의 SVD(reduced-rank singular value decomposition)로 재구성하고, 3) 이에 견실개념(robustness concept)을 적용해서 최적의 성도변수(vocal tract parameter)를 찾아내고 역필터를 적용해서 음원(voice source)을 효과적으로 구분해낸다. 본 논문에서 제안된 방법으로 음원을 추정할 경우, 변화가 심한 음원파형을 잘 추정할 수 있으며, 음원의 특성을 구분해낸 성도 파라미터도 효과적으로 추정할 수 있다. 본 연구내용은 음성합성에서 자연성 개선 및 개인성 구현을 위해서 필수적이며, 다양한 형태의 음성을 표현하기 위해 사용되어질 수 있다. 또한, 음성코딩, 화자인식, 음성인식에서도 사용되어질 수 있다.

  • PDF

A Study on Annoyance of Interior Noise on Town-Bus

  • Park, Hyung Woo;Kim, Sung Han
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.9 no.2
    • /
    • pp.42-47
    • /
    • 2017
  • In these days, the size of urban is growing and the function of city becomes complicated. And also, in city, people lives a lot. The life of urban is getting closer and linked with neighboring people in many parts. Especially, when peoples are exposing during using public transportation, even though does not be known, in they were living. Seoul is the most crowded place in Korea. In Seoul,The village buses have been serviced to the narrow streets. And people who use this bus, wants to seek the comfort of the ride, air quality and noise during in vehicle. In this paper, we determine the degree of annoyance with the noise inside the town bus in dB scale. And, such a situation was confirmed annoyance see their effect. The Interior noise did not see a big difference in the new car and the old car. Annoyance but also according to the skill of the bus driver remains the difference was confirmed.

Performance Analysis of Voice over ATM using AAL2 based on Packet Delay Evaluation (ATM망에서 AAL2를 이용한 음성패킷 전송에 관한 성능분석)

  • 김원순;김태준;홍석원;오창석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.10B
    • /
    • pp.1852-1860
    • /
    • 1999
  • This paper studied performance of the AAL2 for variable rate real time services in ATM network with discrete-time simulation model. In this simulation, input parameters are packet fill delay for AAL2 PDU generation, guard time for ATM cell generation, burstness and number of channels. Though variation of the above mentioned parameters, we obtained end-to end delay variations and throughput, analyzed performance effect of the each parameter for voice packet service.

  • PDF

The Change of the Correlation between GRBAS Scales and MDVP Parameters according to the Different Length of Voice Samples for MDVP Analysis (음성 Sample의 길이 변화에 따른 MDVP 측정치와 GRBAS 척도간의 상관관계 변화 비교)

  • Pyo, H.Y.;Sim, H.S.;Lim, S.E.
    • Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.71-81
    • /
    • 2000
  • The present study was performed to find efficient and useful length of voice sample for MDVP analysis, by investigating the correlation between perceptual GRBAS scales and objective MDVP measurements, with the five different lengths of voices of 20 patients with vocal polyp: 0.5, 1.0, 1.5, 2.0, 2.5 seconds. The results are following: (1) 1.5-second sample of MDVP showed the highest correlation between the perceptual judgement and objective measurement, and 1.0-second sample showed the lowest. The difference between the two samples was found in the number of the statistically significant correlated pairs of MDVP parameter-GRBAS scale. (2) The two extreme edges of the lengths, 0.5-second and 2.5 second showed no statistically significant difference.

  • PDF

The Effect of Noise on the Normal and Pathological Voice (소음환경이 정상 및 병적음성에 미치는 영향)

  • Hong, Ki-Hwan;Yang, Yoon-Soo;Kim, Hyun-Gi
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.27-38
    • /
    • 2002
  • The purpose of this article is to present the acoustic parameters (VOT, jitter, shimmer, vF0, vAm, NHR, SPI, VTI, DVB, DSH) for consonants (/pipi/, /$p^{h}ip^{h}i$/, /p'ip'i/) and sustained vowels (/a/, /e/, /i/) produced by normal subjects and dysphonia patients at two vocal effort(normal, high) by Lombard effect using 60dB white noise. Lombard effect indicates the vocal effort increase in noisy situation. At normal vocal effort, in general the acoustic parameter values of patients are greater than normal. And in noisy situation, significant decrease of acoustic values is seen in normal compared with in dysphonia patients. The clinical implication of this finding, the vocal quality in dysphonia is not compensated by vocal effort as well as normal subjects because of the inefficiency caused by abnormal vocal fold appearance and function. And with this result, we can counsel that the voice quality can not be improved as well as the patient expect.

  • PDF

A Single Channel Voice Activity Detection for Noisy Environments Using Wavelet Packet Decomposition and Teager Energy (웨이블렛 패킷 변환과 Teager 에너지를 이용한 잡음 환경에서의 단일 채널 음성 판별)

  • Koo, Boneung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.33 no.2
    • /
    • pp.139-145
    • /
    • 2014
  • In this paper, a feature parameter is obtained by applying the Teager energy to the WPD(Wavelet Packet Decomposition) coefficients. The threshold value is obtained based on means and standard deviations of nonspeech frames. Experimental results by using TIMIT speech and NOISEX-92 noise databases show that the proposed algorithm is superior to the typical VAD algorithm. The ROC(Receiver Operating Characteristics) curves are used to compare performance of VAD's for SNR values of ranging from 10 to -10 dB.

Synthetic Speech Quality Improvement By Glottal parameter Interpolation - Preliminary study on open quotient interpolation in the speech corpus - (성대특성 보간에 의한 합성음의 음질향상 - 음성코퍼스 내 개구간 비 보간을 위한 기초연구 -)

  • Bae, Jae-Hyun;Oh, Yung-Hwa
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.63-66
    • /
    • 2005
  • For the Large Corpus based TTS the consistency of the speech corpus is very important. It is because the inconsistency of the speech quality in the corpus may result in a distortion at the concatenation point. And because of this inconsistency, large corpus must be tuned repeatedly One of the reasons for the inconsistency of the speech corpus is the different glottal characteristics of the speech sentence in the corpus. In this paper, we adjusted the glottal characteristics of the speech in the corpus to prevent this distortion. And the experimental results are showed.

  • PDF

Design and Implementation of Speaker Verification System Using Voice (음성을 이용한 화자 검증기 설계 및 구현)

  • 지진구;윤성일
    • Journal of the Korea Society of Computer and Information
    • /
    • v.5 no.3
    • /
    • pp.91-98
    • /
    • 2000
  • In this paper we design implement the speaker verification system for verifying personal identification using voice. Filter bank magnitude was used as a feature parameter and code-book was made using LBG a1gorithm. The code book convert feature parameters into code sequence. The difference between reference pattern and input pattern measures using DTW(Dynamic Time Warping). The similarity measured using DTW and threshold value derived from deviation were used to discriminate impostor from client speaker.

  • PDF

Analysis of Voice and Swallowing Symptoms after Thyroidectomy in Patients without Recurrent Laryngeal Nerve Injury in Early Postoperative Period (반회후두신경 손상을 동반하지 않은 갑상선 절제술 환자에서 수술 초기의 음성 및 연하 기능의 변화에 대한 분석)

  • Kim, Heejin;Keum, Bo-Ram;Kim, Geun Hee;Jeon, Seung Sik;Kim, Hyejeen;Kim, Sung Kyun;Hong, Seok Jin;Hong, Seok-Min;Kim, Yong-Bok;Park, Il-Seok
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.27 no.2
    • /
    • pp.108-113
    • /
    • 2016
  • Background and Objectives : After thyroidectomy, many patients experience problems report such things as reduced voice range and vocal fatigue, swallowing problems without superior and recurrent laryngeal nerve injury. The purpose of this study was to evaluate voice and swallowing problems before and after thyroid surgery without laryngeal nerve injury. Materials and Methods : Ninety-three patients who underwent thyroidectomy without laryngeal nerve injury and completed the follow-up evaluations were studied between June 2013 and December 2015. Each evaluation was performed preoperatively, as well as 1 week, 1 month postoperatively. Analysis was performed including voice handicap index (VHI), dysphagia handicap index (DHI), and acoustic voice analysis. Results : Patients show significant variation of parameters in the fundamental frequency (F), maximal phonation time (MPT), shimmer, jitter and soft phonation index (SPI) early after operation, and most of them showed recovery of parameters after 1month of operation. Perceptive complaint of voice and swallowing also showed significant decreased after operation (p<0.005). After 1month of operation, MPT, highest frequency and frequency ranges still showed significant decreased parameters. Comparing acoustic and perceptive parameters of total thyroidectomy and lobectomy, there was no significant changes between them except highest frequency (p=0.042). Conclusion : The results from both subjective and objective evaluations show voice and swallowing disturbance after thyroidectomy even in the absence of laryngeal nerve and provide patients information about the recovery process after surgery. Highest frequency parameter showed most significant changes after operation.

  • PDF