• Title/Summary/Keyword: Voice Feature

Search Result 232, Processing Time 0.023 seconds

Design and Implementation of Home Network Information Appliance Remote Control System Using Voice XML Technology (VoiceXML기술을 이용한 홈네트워크 정보기기 원격 제어 시스템의 설계 및 구현)

  • 이진구;정문상
    • Proceedings of the IEEK Conference
    • /
    • 2002.06a
    • /
    • pp.133-136
    • /
    • 2002
  • VoiceXML is designed for creating audio dialogs that feature synthesized speech, degitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony, and mixed-initiative conversations. Uses the VoiceXML and there is a Place objective which does information home appliance machinery and tools control. When it uses tile VoiceXML, il will be able to provide a bias characteristic to the user The XML base the gearing with different civil official system is possible. With studying YoiceXML and OSGi, this paper has designed and implemented the control architecture of Information home appliances.

  • PDF

A Systematic Review on Voice Characteristics and Risk Factors of Voice Disorder of Korea Teachers (우리나라 교사의 음성 특성과 음성장애 위험 요인에 관한 체계적 문헌고찰)

  • Cha, Seulki;Byeon, Haewon
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.8
    • /
    • pp.149-154
    • /
    • 2018
  • As the range of professional voice users are expanding, interest towards voice increases as well. Especially as teachers compose the occupational group, exposed to high risk of voice disorder, it is necessary to identify the cause of speech problems and speech disorders. The purpose of this study is to analyze the voice characteristics of teachers and to investigate the causes of voice disorders. From 2000 to 2018, 414 studies were found under a combinated set search words of 'profession', 'Teacher', 'Professional Voice User', 'Voice', 'Voice disorders', 'Risk' and out of them, 8 studies were selected as final focus analysis subjects. The qualitative evaluation was carried out by modifying the Quality: checklist for assessing the Risk of bias. The study confirmed that voice misuse frequently occurred to teachers when they used their voice and this feature was affected by the environment. These results suggest that environment improvement of teachers' speech abuse and consistent voice education are necessary.

Design and Implementation of Korean Voice Web Browser (한국어 음성 웹브라우저 설계 및 구현)

  • Jang, Young-Gun;Jo, Kyoung-Hwan
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.5
    • /
    • pp.458-466
    • /
    • 2001
  • This paper is addressed to a design and implementation of Korean voice web browser using voice technologies for controling web browser and selecting contents in the web document, and converting them to voice after HTML analysis. Main feature of this web browser is universal design which considers both of normal person and visual disabled, allows multi-modal interface. As voice interface for visual disabled, it supports tree structure which allows to recognize web document structure easily by only voice guidance regardless of frame usage, can handle all elements described as tag in the web document, identify them as predefined different voice property according to element property. This method gets rid of additional guidance voice for element property without audio style sheet or additional programming effort.

  • PDF

Voice Activity Detection Based on SVM Classifier Using Likelihood Ratio Feature Vector (우도비 특징 벡터를 이용한 SVM 기반의 음성 검출기)

  • Jo, Q-Haing;Kang, Sang-Ki;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.8
    • /
    • pp.397-402
    • /
    • 2007
  • In this paper, we apply a support vector machine(SVM) that incorporates an optimized nonlinear decision rule over different sets of feature vectors to improve the performance of statistical model-based voice activity detection(VAD). Conventional method performs VAD through setting up statistical models for each case of speech absence and presence assumption and comparing the geometric mean of the likelihood ratio (LR) for the individual frequency band extracted from input signal with the given threshold. We propose a novel VAD technique based on SVM by treating the LRs computed in each frequency bin as the elements of feature vector to minimize classification error probability instead of the conventional decision rule using geometric mean. As a result of experiments, the performance of SVM-based VAD using the proposed feature has shown better results compared with those of reported VADs in various noise environments.

Speech Enhancement Based on Voice/Unvoice Classification (유성음/무성음 분리를 이용한 잡음처리)

  • 유창동
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.374-379
    • /
    • 2002
  • In this paper, a nobel method to reduce noise using voice/unvoice classification is proposed. Voice and unvoice are an important feature of speech and the proposed method processes noisy speech differently for each voice/unvoice part. Speech is classified into voice/unvoice using zero-crossing rate and energy, and a modified speech/noise dominant-decision is proposed based on voice/unvoice classification. The proposed method was tested on conditions of white noise and airplane noise, and on the basis of comparing segmental SNR with the existing method and listening to the enhanced speech, a performance of the proposed method was superior to that of the existing method.

Implementation of the Timbre-based Emotion Recognition Algorithm for a Healthcare Robot Application (헬스케어 로봇으로의 응용을 위한 음색기반의 감정인식 알고리즘 구현)

  • Kong, Jung-Shik;Kwon, Oh-Sang;Lee, Eung-Hyuk
    • Journal of IKEEE
    • /
    • v.13 no.4
    • /
    • pp.43-46
    • /
    • 2009
  • This paper deals with feeling recognition from people's voice to fine feature vectors. Voice signals include the people's own information and but also people's feelings and fatigues. So, many researches are being progressed to fine the feelings from people's voice. In this paper, We analysis Selectable Mode Vocoder(SMV) that is one of the standard 3GPP2 codecs of ETSI. From the analyzed result, we propose voices features for recognizing feelings. And then, feeling recognition algorithm based on gaussian mixture model(GMM) is proposed. It uses feature vectors is suggested. We verify the performance of this algorithm from changing the mixture component.

  • PDF

A study on Voice Recognition using Model Adaptation HMM for Mobile Environment (모델적응 HMM을 이용한 모바일환경에서의 음성인식에 관한 연구)

  • Ahn, Jong-Young;Kim, Sang-Bum;Kim, Su-Hoon;Hur, Kang-In
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.3
    • /
    • pp.175-179
    • /
    • 2011
  • In this paper, we propose the MA(Model Adaption) HMM that to use speech enhancement and feature compensation. Normally voice reference data is not consider for real noise data. This method is not to use estimated noise but we use real life environment noise data. And we applied this contaminated data for recognition reference model that suitable for noise environment. MAHMM is combined with surround noise when generating reference patten. We improved voice recognition rate at mobile environment to use MAHMM.

A Study on Voice Recognition using Noise Cancel DTW for Noise Environment (잡음환경에서의 Noise Cancel DTW를 이용한 음성인식에 관한 연구)

  • Ahn, Jong-Young;Kim, Sung-Su;Kim, Su-Hoon;Koh, Si-Young;Hur, Kang-In
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.4
    • /
    • pp.181-186
    • /
    • 2011
  • In this paper, we propose the Noise Cancel DTW that to use a kind of feature compensation. This method is not to use estimated noise but we use real life environment noise data for Voice Recognition. And we applied this contaminated data for recognition reference model that suitable for noise environment. NCDTW is combined with surround noise when generating reference patten. We improved voice recognition rate at mobile environment to use NCDTW.

The Voice of Mathematics Teacher Guides from Their Use of Pronouns, Modality, and Imperatives

  • Suh, Heejoo
    • Research in Mathematical Education
    • /
    • v.22 no.3
    • /
    • pp.223-243
    • /
    • 2019
  • Researchers have been attending to the potential of curriculum materials as resources for professional development. In order for a curriculum material to fulfil such purpose, curriculum authors should intentionally attend to educativeness of the material. A feature of educative material is that its voice speaks to teachers. In this study, I explore educativeness of Algebra teacher guides by attending to their voice. In particular, I focused on the use of pronouns, modality, and imperatives. Findings indicate that some teacher guides have more educative voice than the others and that the amount each guide talk to teachers were less than sufficient. Implications for future research and practice are discussed.

Acoustic Evidence for the Development of Aspiration Feature in Putonghua Stops

  • Han, Ji-Yeon
    • Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.201-209
    • /
    • 2005
  • This study was investigated developmental temporal features in Putonghua-speaking children. The total of 212 children between the ages 2;6 and 6;5 participated in Shanghai. Speech materials were constructed according to aspiration feature in stop sounds of Putonghua. Six words were selected in this study. A voice onset time was measured. Non-parametric procedures were employed for all the analyses. The VOT value across bilabial, alveolar, and velar stops was significantly differed between aspirated and unaspirated stops for each age group. Effect of age is. significant for unaspirated stops. It is clear that each of Putonghua stops showed decreasing mean and standard deviation. The overshoot phenomenon of VOT was apparent from the age of 2;6-2;11 to 4;6-4;11. There was high variability in the production of lag time for aspirated stops.

  • PDF