• Title/Summary/Keyword: Voice pitch

Search Result 265, Processing Time 0.021 seconds

On a Speech Coding Algorithm for Low Cost Implementation of Voice Telegram System (보이스 전보 시스템 구현을 위한 저가형 음성파형 부호화 알고리즘)

  • 나덕수;민소연;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.2
    • /
    • pp.101-105
    • /
    • 2000
  • A telegram has been used to transmit the emergency news or celebration message. So, it has been very important media in our life. Although the telegram processing is more and more convenient, on the other hand, the telegram service contains only text message. The voice telegram is that delivering user's voice with text message. So, the voice telegram can be delivered sender's emotions and feelings. However, since voice information contains lots of data, large memory size and high cost processor are needed to deliver itself. In this paper, we proposed a new speech waveform coding method that has low complexity and low cost implementation for the voice telegram system. First, we fixed one basic speech waveform per pitch period and measured the waveform similarity between basic and neighbor speech waveform. Second, if the similarity satisfied threshold values, we compress the neighbor speech waveform with pitch and magnitude value per pitch period and if not, we save speech waveform. When the compression is about 45%, we obtained about 4 point in MOS.

  • PDF

On a Processing Time Reduction of Cepstrum-Based Pitch Alteration in Time-Frequency Hybrid Domain (켑스트럼 기반 혼성영역 피치변경법의 처리시간 단축에 관한 연구)

  • Jo, Wang-Rae;Kim, Jong-Kuk;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.41-47
    • /
    • 2010
  • The pitch alteration technique for voice conversion is classified in time domain, frequency domain and hybrid domain. The Hybrid domain method has a merit of clearness and natural-ness of pitch altered speech but has the major drawback of long processing time. In this paper, we proposed a new method that can reduce the processing time of pitch alteration in time-frequency hybrid domain. We omitted the bit-reversing process of FFT and IFFT in changing the processing domain. Therefore we can reduce the processing time by 86.26% to the conventional method with same quality.

Comparison of subjective voice symptoms in elite vocal performers and professional voice users (전문 음성사용자와 직업적 음성사용자의 주관적 음성증상 비교)

  • Ji-sung Kim
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.27-34
    • /
    • 2023
  • This study aimed to provide knowledge helpful for understanding voice problems related to occupations in the clinical field through an investigation and comparison of subjective vocal symptoms of 12 professional actors and 12 speech-language pathologists Among the 11 symptoms, "Difficulty with high pitch when singing," "Hypertension in the neck when speaking," and "Feel voice fatigue" were the most frequent symptoms in both groups. Additionally, the professional voice users reported a higher frequency of "Difficulty with high pitch when singing" (p=.049), "Hoarse voice" (p=.021), "Difficulty (requiring effort) when speaking" (p=.032), "Pain in the neck when speaking" (p=.009), and "Feel vocal fatigue" (p=.018) than the elite vocal performer group. This may be due to the different voice-related environments and differences in voice demands during occupational activities between the two groups.

On a Reduction of Pitch Search Time for IMBE Vocoder by Using the Spectral AMDF (SAMDF를 이용한 IMBE VOCODER의 피치 검색 시간 단축에 관한 연구)

  • 홍성훈
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.155-158
    • /
    • 1998
  • IMBE(Improved Multi-Band Excitation) vocoders exhibit good performance at low data rates. The major drawback to IMBE coders is their large computational requirements. In this paper, thus, we propose a new pitch search method that preserves the quality of the IMBE vocoder with reduced complexity. The basic idea is to reduce computation complexity of the pitch searching by using the SAMDF. Applying the proposed method to the IMBE vocoder, we can get approximately 52.02% searching time reduction in the pitch search. There is no difference in voice quality between conventional IMBE and proposed IMBE.

  • PDF

The Effect of Helium Gas Intake on the Characteristics Change of the Acoustic Organs for Voice Signal Analysis Parameter Application (음성신호 분석 요소의 적용으로 헬륨가스 흡입이 음성 기관의 특성 변화에 미치는 영향)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The KIPS Transactions:PartB
    • /
    • v.18B no.6
    • /
    • pp.397-404
    • /
    • 2011
  • In this paper, we were carried out experiments to apply parameter of voice analysis to measure changing characteristic articulator according to inhale the helium gas. The helium gas was used to overcome air embolism nitrogen gas to deal a fatal blow in body nitrogen gas by diver. However, the helium gas has been much trouble interpretation about abnormal voice of diver to cause squeaky voice of low articulation. Therefor, we was carried out experiments about pitch and spectrogram measurement, analysis based on to influence in acoustic organs before and after of inhaled helium gas.

The Change of the Voice Parameters in Long-term Sensorineural Hearing Loss Patients (장기간의 양측 감각신경성 난청환자에서 음성지표의 변화)

  • 윤자복;조경래;정상원;최정환;유영삼;우훈영;이강수
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.2
    • /
    • pp.140-144
    • /
    • 2001
  • Backgrounds & Objectives : Prolonged hearing loss was considered as one of the factors which have the potential to cause vocal changes. However, the analysis of quality of phonation in hearing loss patients has not been achieved enough. The purpose of the study was to evaluate the difference in objective acoustic parameters between long-term hearing impaired patients and normal control group. Material & Methods : The material of this investigation comprised a group of 20 patients (M : F=10 : 10) with moderate or profound hearing loss(over 50dB). The duration of all hearing loss was over 1 year. All of them underwent the acoustic examinations comprising electroglottography, multidimensional voice program and formant analysis during phonation of the bowels /a/ with free confortable tone and /i/ with voluntary high tone. The results of the acoustic examinations were compared with those of a control group, composed of 20 sex- and age-matched normal hearing subjects. Results : In the male hearing loss subjects, the significant increase was detected in pitch and shimmer during phonation of /a/ and in pitch during phonation of /i/. In addition, this group was characterized by decreased fundamental frequency during phonation of /i/. In female, there was no difference between hearing loss group and normal control group except a decreased formant 1 frequency. Conclusion : Long-term moderate and profound sensorineural hearing loss could affect the objective voice parameters.

  • PDF

Voice Conversion Using Linear Multivariate Regression Model and LP-PSOLA Synthesis Method (선형다변회귀모델과 LP-PSOLA 합성방식을 이용한 음성변환)

  • 권홍석;배건성
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.15-23
    • /
    • 2001
  • This paper presents a voice conversion technique that modifies the utterance of a source speaker as if it were spoken by a target speaker. Feature parameter conversion methods to perform the transformation of vocal tract and prosodic characteristics between the source and target speakers are described. The transformation of vocal tract characteristics is achieved by modifying the LPC cepstral coefficients using Linear Multivariate Regression (LMR). Prosodic transformation is done by changing the average pitch period between speakers, and it is applied to the residual signal using the LP-PSOLA scheme. Experimental results show that transformed speech by LMR and LP-PSOLA synthesis method contains much characteristics of the target speaker.

  • PDF

A Study on Speech Period and Pitch Detection for Continuous Speech Recognition (연속음성인식을 위한 음성구간과 피치검출에 관한 연구)

  • Kim Tai Suk;Chang jong chil
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.56-61
    • /
    • 2005
  • In this thesis, propose speech period and pitch detection for continuous speech recognition. This mathod is distinguishes between vowel and consonant to frame unit in continuous speech, for distinguishable voice. Powerful extraction of speech period could threshold energy make use of input signal to real noise environment. Also algorithm of this method distinguish between vowel and consonant at the same time in voice make use of zero crossing rate and short time energy to extractible speech period.

  • PDF

The Study for Advancing the Performance of Speaker Verification Algorithm Using Individual Voice Information (개별 음향 정보를 이용한 화자 확인 알고리즘 성능향상 연구)

  • Lee, Je-Young;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.253-263
    • /
    • 2002
  • In this paper, we propose new algorithm of speaker recognition which identifies the speaker using the information obtained by the intensive speech feature analysis such as pitch, intensity, duration, and formant, which are crucial parameters of individual voice, for candidates of high percentage of wrong recognition in the existing speaker recognition algorithm. For testing the power of discrimination of individual parameter, DTW (Dynamic Time Warping) is used. We newly set the range of threshold which affects the power of discrimination in speech verification such that the candidates in the new range of threshold are finally discriminated in the next stage of sound parameter analysis. In the speaker verification test by using voice DB which consists of secret words of 25 males and 25 females of 8 kHz 16 bit, the algorithm we propose shows about 1% of performance improvement to the existing algorithm.

  • PDF

A Case Study on Vocal Aerobic Treatment Voice Therapy Development and Application for Classical Singers (성악가를 위한 VAT 음성치료 개발 및 적용 사례연구)

  • Yoo, Jae-Yeon;Lee, Ha-Na
    • 재활복지
    • /
    • v.22 no.1
    • /
    • pp.157-168
    • /
    • 2018
  • The purpose of this study is to investigate the impact of semi-closed vocal training-based Vocal Aerobic Treatment on the voice improvement of soprano. Study subject was one soprano who appealed to the suffering of her voice problem due to vocal cord nodule. A study method of conducting pre/post acoustic evaluation and subjective voice evaluation to compare the measures was used; Vocal Aerobic Treatment was carried out twice a week for a total of 32 session. In the acoustic evaluation, MDVP (multi-dimensional voice program) and VRP (voice range profile) were used to evaluate the pitch, voice quality, and voice range; in the subjective voice evaluation, SVHI (singing voice handicap index) was used to assess voice satisfaction. As a result of the pitch evaluation, the soprano maintained a proper Fo. As a result of the voice quality evaluation, the jitter, shimmer, and the noise harmonic ratio numbers decreased compared to the numbers shown before the treatment. As a result of the voice range evaluation, the scope of the range was broadened, with the number of semitone increasing from 30 to 35. As for the subjective voice evaluation, the result of the total score obtained after the survey report divided by the number of questions showed a decrease from 3.6 to 0.6. The soprano herself reported of having a minor extent of a voice problem. The summary of the above results reflects that Vocal Aerobic Treatment is useful in the voice improvement of vocalists However, as this study is case research regarding the Vocal Aerobic Treatment effect on one soprano, further research on the treatment effect covering many other vocalists is necessary. Also, there is a need for follow-up studies regarding voice management and voice treatment program on not only the vocalists but also the voice users in many other professions.