• Title/Summary/Keyword: Synthesized Voice

Search Result 40, Processing Time 0.021 seconds

Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science (Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석)

  • 최홍식;김성수
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.12 no.2
    • /
    • pp.115-120
    • /
    • 2001
  • In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.

  • PDF

Prediction of Post-Treatment Outcome of Pathologic Voice Using Voice Synthesis (음성합성을 이용한 병적 음성의 치료 결과에 대한 예측)

  • 이주환;최홍식;김영호;김한수;최현승;김광문
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.14 no.1
    • /
    • pp.30-39
    • /
    • 2003
  • Background and Objectives : Patients with pathologic voice often concern about recovery of voice after surgery. In our investigation, we give controlled values of three parameters of voice synthesis program of Dr. Speech Science. such as jitter, shimmer, and NNE(normalized noise energy) which characterize someone's voice from others and deviced a method to synthesize the predicted voice after performing operation. Subjects and Method : Values of vocal jitter, vocal shimmer, and glottal noise were measured with voices of 10 vocal cord Paralysis and 10 vocal Polyp Patients 1 week Prior to and 1 month after the surgery. With Dr. Speech science voice synthesis program we synthesized 'ae' vowel which is closely identical to preoperative and post-operative voice of the patients by controlling the values of jitter, shimmer, and glottal noise. then we analyzed the synthesized voices and compared with pre and post-operative voice. Results : 1) After inputting the preoperative and corrected values of jitter, shimmer, and glottal noise into the voice synthesis Program, voices identical to vocal Polyp Patients' Pre- and Postoperative voices withiin statistical significance were synthesized 2) After elimination of synergistic effects between three paramenter, we were able to synthesize voice identical to vocal paralysis patients' preoperative voices. 3) After inputting only slightly increased jitter, shimmer into the synthesis program, we were able to synthesize voice identical to vocal cord paralysis patients' postoperative voices. Conclusion : Voices synthesized with Dr. Speech science program were identical to patients' actual pre and postoperative voice, and clinicians will be able to give the patients more information and thus increased patients cooperability can be expected.

  • PDF

Ergonomic Design of Voice Warning Sounds Used in Utility Helicopter (기동헬기 음성 경고음의 인간공학적 설계에 관한 연구)

  • Jung, Jonghyuk;Kim, Taekon;Koh, Jinhwan
    • Journal of IKEEE
    • /
    • v.17 no.2
    • /
    • pp.189-201
    • /
    • 2013
  • This paper presents an experimental study of the factors modulating the urgency perception of voice alarm generated by concatenative synthesizers. Four experiments were conducted using psycho-physical approach in which 112 participants made magnitude estimation for urgency perception of various voice alarm stimuli. Experiment 1 identified 6 acoustic and non-acoustic factors modulating the perceived urgency of synthesized voice alarm. Experiment 2, 3 and 4 quantified the relations between the objective changes in each of the quantifiable parameters and the subjective changes in urgency perception. This research has implications for the design and implementation of synthesized voice alarm systems where urgency mapping is required.

Quantifying the Urgency Perception of Voice Alarm Generated by Concatenative Synthesizer (연결형 합성음성을 이용한 경보음의 주관적 위급도 정량화)

  • Jang, Pil-Sik;Lee, Gyeong-Tae
    • Journal of the Ergonomics Society of Korea
    • /
    • v.25 no.2
    • /
    • pp.63-70
    • /
    • 2006
  • This paper presents an experimental study of the factors modulating the urgency perception of voice alarm generated by concatenative synthesizers. Four experiments were conducted using psycho-physical approach in which 105 participants made magnitude estimation for urgency perception of various voice alarm stimuli. Experiment 1 identified 6 acoustic and non-acoustic factors modulating the perceived urgency of synthesized voice alarm. Experiment 2, 3 and 4 quantified the relations between the objective changes in each of the quantifiable parameters and the subjective changes in urgency perception. This research has implications for the design and implementation of synthesized voice alarm systems where urgency mapping is required.

A Design of ADPCM CODEC Core for Digital Voice and Image Processing SOC (디지털 음성 및 영상 처리용 SOC를 위한 ADPCM CODEC 코어의 설계)

  • 정중완;홍석일;한희일;조경순
    • Proceedings of the IEEK Conference
    • /
    • 2001.06b
    • /
    • pp.333-336
    • /
    • 2001
  • This paper describes the design and implementation results of 40, 32, 24 and 16kbps ADPCM encoder and decoder circuit, based on the protocol CCITT G.726. We verified the ADPCM algorithm using C language and designed the RTL circuit with Verilog HDL. The circuit has been simulated by Verilog-XL, synthesized by Design Compiler and verified using Xilinx FPGA. Since the synthesized circuit includes a small number of gates, it is expected to be used as a core module in the digital voice and image processing SOC.

  • PDF

Voice Source Modeling Using Harmonic Compensated LF Model (LF 모델에 고조파 성분을 보상한 음원 모델링)

  • 이건웅;김태우홍재근
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1247-1250
    • /
    • 1998
  • In speech synthesis, LF model is widely used for excitation signal for voice source coding system. But LF model does not represent the harmonic frequencies of excitation signal. We propose an effective method which use sinusoidal functions for representing the harmonics of voice source signal. The proposed method could achieve more exact voice source waveform and better synthesized speech quality than LF model.

  • PDF

Voice quality transform using jitter synthesis (Jitter 합성에 의한 음질변환에 관한 연구)

  • Jo, Cheolwoo
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.121-125
    • /
    • 2018
  • This paper describes procedures of changing and measuring voice quality in terms of jitter. Jitter synthesis method was applied to the TD-PSOLA analysis system of the Praat software. The jitter component is synthesized based on a Gaussian random noise model. The TD-PSOLA re-synthesize process is used to synthesize the modified voice with artificial jitter. Various vocal jitter parameters are used to measure the change in quality caused by artificial systematic jitter change. Synthetic vowels, natural vowels and short sentences are used to check the change in voice quality through the synthesizer model. The results shows that the suggested method is useful for voice quality control in a limited way and can be used to alter the jitter component of voice.

합성음성 경보의 주관적 위급도에 관한 연구

  • 박경수;장필식
    • Proceedings of the ESK Conference
    • /
    • 1996.10a
    • /
    • pp.191-196
    • /
    • 1996
  • This paper presents an experimental study of the relationship between sound parameters of synthesized voice warning and perceived(psychoacoustic) urgency. Eighteen subjects participated in two experimental sessions to evaluate and quqntify the effects of the voice parameters. Experiments showed that speech rate, fundamental frequency and voice types have clear and consistent effect on perceived urgency. The results of these experiments can be applied to the improvement of existing auditory warning systems and the design of new systems.

  • PDF

컴퓨터 합성음성 경보의 주관적 위급도 정량화

  • 박경수;장필식;이경태
    • Proceedings of the ESK Conference
    • /
    • 1997.10a
    • /
    • pp.339-345
    • /
    • 1997
  • This paper presents an experimental study of te relationship between sound parameters of synthesized voice warning and perceived (psychoacoustic) urgency. Twenty four subjcts participated in two experimental sessions to evaluate and quantify the effects of te voice parameters. Experiments showed that speech rate, fundamental frequency, fundamental frequency contour types and voice types have clear and consistent effect on perceived urgency. The results of these experiments can be applied to the improvement of existing auditory warning systems and the design of new systems.

  • PDF

Pitch Modification based on a Voice Source Model (음원 모델에 기초한 합성음의 피치 조절)

  • Choi, Yong-Jin;Yeo, Su-Jin;Kim, Jin-Young;Sung, Koeng-Mo
    • Speech Sciences
    • /
    • v.3
    • /
    • pp.132-147
    • /
    • 1998
  • Previously developed methods for pitch modification have not been based on the voice source model. Therefore, the synthesized speech often sounds unnatural although it may be highly intelligible. The purpose of this paper is to analyze the alteration of a voice source signal with pitch period and to establish the pitch-modification rule based on the result of this analysis. We examine the alteration of the interval of closing phase, closed phase and open phase using the excitation waveform as the pitch increases. In comparison to the previous methods which performed directly on the speech signal, the pitch modification method based on a voice source model shows high intelligibility and naturalness. This study might benefit the application to the speaker identification and the voice color conversion. Therefore the proposed method will provide high quality synthetic speech.

  • PDF