• Title/Summary/Keyword: speech quality

Search Result 807, Processing Time 0.028 seconds

Recent Trends in the Treatment of Voice Disorders: Evidence-based Practice and Translational Biology Research (음성 장애 치료 연구의 최근 동향: 증거에 기초한 임상 치료 및 전이 생물학적 연구)

  • Choi, Seong-Hee
    • Phonetics and Speech Sciences
    • /
    • v.2 no.1
    • /
    • pp.99-112
    • /
    • 2010
  • This study attempted to review the recent, high-quality evidence-based practical research related to the treatment effectiveness of voice disorders which focus on randomized controlled trials (RCTs) and translational research of vocal fold tissue engineering for vocal fold regeneration. Methodology including PICO (P; Populations or Patients, I; Interventions, C; Comparison group (control, placebo, gold standard), O; Outcomes or measures made) information for RCTs and animal models (species), regenerative therapy method, and outcomes of translational research for clinical application was summarized and discussed for future voice disorder research.

  • PDF

Experimental study of reverberation time in ship's public area (선박의 공용구역 잔향시간의 실험적 연구)

  • Kim, Taemoo;Choi, Choongyoung;Park, Nojun;Park, JeanHyung;Kwun, Hyuk
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2013.04a
    • /
    • pp.242-242
    • /
    • 2013
  • Recently, importance of working environment is increased in commercial vessel and offshore structure. Marine facility design and ambient environment condition are influenced to enhancing human performance and reducing human errors. Consequently, the quality of accommodation where offshore facility crews sleep, eat and relax will influence their job performance and overall sense of comport and well-being. Therefore, adequate acoustic isolation between adjacent spaces is normally required to achieve satisfactory internal noise levels, acoustic privacy and speech intelligibility. In this study, the reverberation time is investigated in the public areas where there are not provided the information of material's noise reduction coefficient (NRC). The experiment of reverberation time is rarely performed for the various type public areas in a marine structure. Therefore, the reverberation time in a vessel is investigate in order to evaluate the room's noise reduction coefficient (NRC) in a public area.

  • PDF

A CELP Speech Coder Using Dispersed-Pulse and Random Codebook (분산펄스와 랜덤 코드북을 이용한 CELP 음성 부호화기)

  • 황윤성;문인섭;이행우;김종교
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.115-118
    • /
    • 2001
  • This paper presents dispersed-pulse and random codebook for CELP coder. This coder operates on speech frames of 20ms and generates an excitation vector by convoluting dispersion vectors with signed pulses in an algebraic codevector. The improvement of pulse-based fixed codebook is performed at a low bit rate. A high performance fixed-codebook consists of a partial algebraic codebook and a random codebook in unvoiced and stationary noise regions. The proposed CELP coder is quantized with 4kb/s and is compared with G.729 (Bkb/s CS-ACELP). Subjective testing shows better quality than reference coders under some background noise conditions

  • PDF

Voice conversion using low dimensional vector mapping (낮은 차원의 벡터 변환을 통한 음성 변환)

  • Lee, Kee-Seung;Doh, Won;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.4
    • /
    • pp.118-127
    • /
    • 1998
  • In this paper, we propose a voice personality transformation method which makes one person's voice sound like another person's voice. In order to transform the voice personality, vocal tract transfer function is used as a transformation parameter. Comparing with previous methods, the proposed method can obtain high-quality transformed speech with low computational complexity. Conversion between the vocal tract transfer functions is implemented by a linear mapping based on soft clustering. In this process, mean LPC cepstrum coefficients and mean removed LPC cepstrum modeled by the low dimensional vector are used as transformation parameters. To evaluate the performance of the proposed method, mapping rules are generated from 61 Korean words uttered by two male and one female speakers. These rules are then applied to 9 sentences uttered by the same persons, and objective evaluation and subjective listening tests for the transformed speech are performed.

  • PDF

Adaptive TCX Windowing Technology for Unified Structure MPEG-D USAC

  • Lee, Tae-Jin;Beack, Seung-Kwon;Kang, Kyeong-Ok;Kim, Whan-Woo
    • ETRI Journal
    • /
    • v.34 no.3
    • /
    • pp.474-477
    • /
    • 2012
  • The MPEG-D unified speech and audio coding (USAC) standardization process was initiated by MPEG to develop an audio codec that is able to provide consistent quality for mixed speech and music contents. The current USAC reference model structure consists of frequency domain (FD) and linear prediction domain (LPD) core modules and is controlled using a signal classifier tool. In this letter, we propose an LPD single-mode USAC structure using an adaptive widowing-based transform-coded excitation module. We tested our system using official test items for all mono-evaluation modes. The results of the experiment show that the objective and subjective performances of the proposed single-mode USAC system are better than those of the FD/LPD dual-mode USAC system.

Dialog System Using Multimedia Techniques for the Elderly with Dementia

  • 김성일;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.170-170
    • /
    • 2002
  • The goal of the present research is to improve a quality of life of the elderly with a dementia. In this paper, it is realized by developing the dialog system that is controlled by three kinds of modules such as speech recognition engine, graphical agent, or database classified by a nursing schedule. The system was evaluated in an actual environment of a nursing facility by introducing it to an older male patient with dementia. The comparison study between dialog system and professional caregivers was then carried out at nursing home for 5 days in each case. The evaluation results showed that the dialog system was more responsive in catering to needs of dementia patient than professional caregivers. Moreover, the proposed system led the patient to talk more than caregivers did.

The relation between phonetic differences of Korean learners' production of English vowels, pronunciation intelligibility and speaking proficiency test scores (한국인 학습자 영어 모음 발화의 음성학적 차이와 발음 이해도, 말하기 점수와의 관계)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.1-7
    • /
    • 2017
  • The purpose of this study is to investigate the relations between phonetic differences among Korean learners' production of English front vowels, pronunciation intelligibility and speaking proficiency test score. To do so, thirty Korean university students were asked (1) to read English text book paragraphs and (2) describe a picture. Two English native raters and one Korean rater evaluated Korean subjects' English pronunciation intelligibility and speaking. In addition, subjects' English vowel productions were acoustically analyzed(F0, F1, F2, vowel duration, intensity). The results of the study show that the vowel quality and pitch of the unstressed vowels and lax vowel are related to the pronunciation intelligibility. In addition, the scores of pronunciation intelligibility and speaking are highly related.

ON A REDUCTION OF PITCH SEARCHING TIME BY PREPROCESSING IN THE CELP VOCODER

  • Kim, Daesik;Bae, Myungjin;Kim, Jongjae;Byun, Kyungjin;Han, Kichun;Yoo, Hahyoung
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.904-911
    • /
    • 1994
  • Code Excited Linear Prediction (CELP) speech coders exhibit good performance at data rates below 4.8 kbps. The major drawback to CELP type coders is their many computation. In this paper, we propose a new pitch search method that preserves the quality of the CELP vocoder with reducing complexity. The basic idea is to apply the preprocessing technique beforehand grasping the autocorrelation property of speech waveform. By using the proposed method, we can get approximately 77% complexity reduction in the pitch search.

  • PDF

Using Highly Secure Data Encryption Method for Text File Cryptography

  • Abu-Faraj, Mua'ad M.;Alqadi, Ziad A.
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.12
    • /
    • pp.53-60
    • /
    • 2021
  • Many standard methods are used for secret text files and secrete short messages cryptography, these methods are efficient when the text to be encrypted is small, and the efficiency will rapidly decrease when increasing the text size, also these methods sometimes have a low level of security, this level will depend on the PK length and sometimes it may be hacked. In this paper, a new method will be introduced to improve the data protection level by using a changeable secrete speech file to generate PK. Highly Secure Data Encryption (HSDE) method will be implemented and tested for data quality levels to ensure that the HSDE destroys the data in the encryption phase, and recover the original data in the decryption phase. Some standard methods of data cryptography will be implemented; comparisons will be done to justify the enhancements provided by the proposed method.

Grammatical Quality Estimation for Error Correction in Automatic Speech Recognition (문법성 품질 예측에 기반한 음성 인식 오류 교정)

  • Mintaek Seo;Seung-Hoon Na;Minsoo Na;Maengsik Choi;Chunghee Lee
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.608-612
    • /
    • 2022
  • 딥러닝의 발전 이후, 다양한 분야에서는 딥러닝을 이용해 이전에 어려웠던 작업들을 해결하여 사용자에게 편의성을 제공하고 있다. 하지만 아직 딥러닝을 통해 이상적인 서비스를 제공하는 데는 어려움이 있다. 특히, 음성 인식 작업에서 음성 양식에서 이용 방안에 대하여 다양성을 제공해주는 음성을 텍스트로 전환하는 Speech-To-Text(STT)은 문장 결과가 이상치에 달하지 못해 오류가 나타나게 된다. 본 논문에서는 STT 결과 보정을 문법 교정으로 치환하여 종단에서 올바른 토큰들을 조합하여 성능 향상을 하기 위해 각 토큰 별 품질 평가를 진행하는 모델을 한국어에서 적용하고 성능의 향상을 확인한다.

  • PDF