Search | Korea Science

A Study on Voice Color Control Rules for Speech Synthesis System (음성합성시스템을 위한 음색제어규칙 연구)

Kim, Jin-Young;Eom, Ki-Wan
- Speech Sciences
- /
- v.2
- /
- pp.25-44
- /
- 1997
When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.
PDF

Study on Improvement for selecting the optimum voice channels in the radio voice communication (무전기 음성통신에서 최적음성채널 선택을 위한 개선방안에 관한 연구)

Lew, Chang-Guk;Lee, Bae-Ho
- The Journal of the Korea institute of electronic communication sciences
- /
- v.11 no.2
- /
- pp.171-178
- /
- 2016
An aircraft in flight and ATC(: Air Traffic Controllers) working in the Ground Control Center carry out a voice communication using the radio. Voice signal to be transmitted from the aircraft is received to a plurality of terrestrial sites around the country at the same time. The ATC receives the various quality of voice signal from the aircraft depending on the distance, speed, weather conditions and adjusted condition of the antenna and the radio. The ATC carries out a voice communication with aircraft in the optimal conditions finding the best voice signal. However, the present system chooses the values of the CD(: Carrier Dectect) which is determined to be superior to, based on the input voice level, as optimal channel. Thus this system can not be seen to select the optimal channel because it doesn't consider the effect of the noise which influences on the communication quality. In this paper, after removing the noise in the voice signal, we could give the digitized information and an improved voice signal quality, so that users can select an optimal channel. By using it, when operating the training eavesdropping system or the aircraft control, we can expect prevention accident and improvement of training performance by selecting the improved quality channel.
https://doi.org/10.13067/JKIECS.2016.11.2.171 인용 PDF KSCI

Policy and Managerial Issues of Voice over Internet Protocol(VoIP) (인터넷전화의 정책 및 경영이슈측면에서의 이용자분석)

Kim, Ji-Hee;Sung, Yoon-Young;Kweon, O-Sang;Kim, Jin-Ki
- Journal of Information Technology Applications and Management
- /
- v.14 no.4
- /
- pp.221-233
- /
- 2007
Which factors should influence consumer consideration to subscribe to Voice over Internet Protocol (VoIP)? Policy issues, managerial concerns, and demographic variables are possible factors. This paper discusses policy and managerial issues regarding VoIP adoption. A model that explains VoIP adoption is proposed and tested. This study analyzes a survey of 750 prospective VoIP users in Korea. The testing is accompanied by logistic regression and discriminant analysis. The results show that trust in VoIP, relative comparison of Quality to fixed service, numbering plan, satisfactions of call Quality and customer services on both fixed and mobile services have impacts on the adoption of VoIP. Implications for VoIP providers and policy makers are presented.
PDF

An Integrated E-model Implementation for Speech Quality Measurement in VoIP and VoLTE (VoIP와 VoLTE 음성 품질 측정을 위한 통합 E-model 구현)

Kim, Bog-Soon;Baek, Kwang-Hyun;Cho, Gi-Hwan
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.7
- /
- pp.10-18
- /
- 2013
With advancing of mobile communication services and commercializing of VoLTE (Voice of LTE), it is getting to pay attention on QoS of VoLTE. This paper proposes an integrated E-model in which some factors influenced to service quality of VoIP and VoLTE based voice communication system are considered in calculating the voice quality of Wideband Codec. The model aims to calculate R value which reflects the situations of access network, network characteristics, terminals' usage and mobility. We mainly deal with the integrated E-model's structure, related algorithms and optimal parameters for VoLTE. Some experiments show that the voice quality difference between VoIP and VoiceChecker, and VoLTE and POLQA, is below 10%. With the proposed model, we can calculate the voice quality by making use of the factors directly affected to service quality and the environment of VoLTE terminal and network. As a result, we can estimate the service quality in advance, without measuring it in real wireless environment.
https://doi.org/10.5573/ieek.2013.50.7.010 인용 PDF KSCI

The Effects of Nasalance on Quality of Voice (비성이 음질에 미치는 영향에 대한 음향학적 연구)

Ahn, Jong-Bok;Shin, Myung-Sun;Noh, Dong-Woo;Paik, Eun-A;Jeong, Ok-Ran
- Speech Sciences
- /
- v.9 no.3
- /
- pp.133-140
- /
- 2002
The purpose of this study was to investigate any changes in acoustic qualities of voice as ,a function of nasalance, in order to determine the relationship between vocal quality and nasalance. Twenty normal subjects (10 males and 10 females) vocalized /a/, /$\tilde{a}$/, and /a $\eta$/. The changes in nasalance and acoustic characteristics of the voice were analyzed by Nasometer (Model 6200-3, Kay Elemetrics, co) and Dr, Speech 4.0 (Tiger Electronics, Co), respectively. One-way ANOVA was used to examine any changes in jitter, shimmer, harmonics-to-noise ratio, and normalized noise energy relative to the nasalance in 3 types of vocalization. The Person r correlation coefficient was used to identify the relationship between the nasalance and the vocal quality. There was no statistically significant changes in jitter, shimmer, HNR and NNE. The jitter, however, tended to increase as the nasalance socre increased, compared to the other vocal parameters. In addition, the NNE showed an increase on / $\tilde{a}$/, and /a $\eta$/, more on the /a $\eta$/. Thus, it was speculated that NNE could be used to identify or screen resonant disorders with hypernasality
PDF

Effects of communication environment on VoIP capacity using WiFi (통신환경이 WiFi를 이용한 VoIP 서비스 용량에 미치는 영향)

Choi, Dae-Woo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.6
- /
- pp.1327-1332
- /
- 2015
In this paper, we studied several aspects that affect the quality of VoIP using WiFi network. It's clear that the background data traffic within an AP, the end-to-end delay and the traffic loss of TCP/IP network gives serious effects on the voice quality. A kind of access control for the VoIP connection within an AP should be done for the acceptable voice quality.
https://doi.org/10.6109/jkiice.2015.19.6.1327 인용 PDF KSCI KPUBS HTML

FMC Performance and Voice Quality of Enterprise Type connectable to IP-PBX (IP-PBX와 연동 가능한 기업 형 FMC 성능 및 음성품질)

Kim, Sam-Taek
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.15 no.6
- /
- pp.89-94
- /
- 2015
FMS which has a concept that wireless terminal can replace wire terminal services is a technologies that is can provide service costs same as wire terminal in the special zone. Enterprise type of FMC that is developed making up for the weak point is must have to improve voice quality and FMC performance in the soft phone. This paper measure voice quality based on the one way of the total estimated delay time of FMC to carry out IMS services between IP-PBX and FMC soft-phone to operate it's controller optimally and put forward evidence to be in 120ms and 150ms in the VoIP FMC voice quality. To measure FMC performances in four categories evaluated trials and prove its performances.
https://doi.org/10.7236/JIIBC.2015.15.6.89 인용 PDF KSCI

Playout Scheduling Method Based on Adaptive Jitter Estimation for Enhancing VoIP Speech Quality (VoIP 음질향상을 위한 적응적 지터추정 기반의 플레이아웃 스케줄링 방법)

Ryu, Sang-Hyeon;Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- v.33 no.2
- /
- pp.133-138
- /
- 2014
Packet arrival-delay variation, so-called 'jitter' is one of the main factors that degrade the quality of voice in mobile devices at the Voice over Internet Protocol (VoIP). To resolve this issue, a playout scheduling based on adaptive jitter estimation for enhancing VoIP speech quality is proposed. The proposed algorithm copes with the effect of transmission jitter by expanding or compressing each packet according to the predicted network delay and variations. Additionally, the active network jitter estimation incorporates rapid detection of delay spikes and reacts to changes in network conditions. The experimental results have shown that the proposed algorithm delivers high voice quality in unstable network environment.
https://doi.org/10.7776/ASK.2014.33.2.133 인용 PDF KSCI

Effects of Laryngeal Massage on Muscle Tension Dysphonia: A Systematic Review and Meta-Analysis (근긴장성 발성장애의 후두마사지 효과: 체계적 고찰 및 메타분석)

Kim, Jaeock
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.32 no.2
- /
- pp.64-74
- /
- 2021
Background and Objectives This study was to investigate the voice quality and articulation effects of laryngeal massage on muscle tension dysphonia (MTD). Materials and Method A systematic review of articles published between January 2000 and December 2020 in Cochrane, PubMed, ScienceDirect, SpingerLink, ERIC, and Naver Academic was conducted. From the total of 2094 articles identified, 10 peer-reviewed articles were included in a meta-analysis. Mean effect sizes of the variables related to voice quality (jitter, shimmer, harmonic to noise ratio or noise to harmonic ratio, high-F0, low-I, cepstral peak prominence) and articulation (F1, F2, F1 slope, F2 slope) were calculated by Hedges'g. Results Meta-analysis of the selected articles showed that laryngeal massage had medium to large effects on all variables of voice quality and articulation except F0-high and F1 slope in the MTD patients. Conclusion This study provided comprehensive clinical evidence that it is highly desirable to apply laryngeal massage to MTD patients.
https://doi.org/10.22469/jkslp.2021.32.2.64 인용 PDF KSCI

The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing (음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성)

Choi, Seong Hee;Choi, Chul-Hee
- Phonetics and Speech Sciences
- /
- v.6 no.3
- /
- pp.63-72
- /
- 2014
The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.
https://doi.org/10.13064/KSSS.2014.6.3.063 인용 PDF KSCI

Search Result 769, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)