• Title/Summary/Keyword: Voice problem

Search Result 338, Processing Time 0.025 seconds

The Development of Personal Computer Control System Using Voice Command (음성 명령을 이용한 개인용 컴퓨터 조작 시스템의 구현)

  • Lee, Tae Jun;Kim, Dong Hyun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.101-102
    • /
    • 2018
  • Users who using computer may experience fatigue or sickness on their wrists if they use the keyboard and mouse for a long time. People with physical disabilities will find it difficult to work with the keyboard and mouse. There is a problem in that the substitute product for solving this is limited in function or expensive. In this paper, we development a system for controlling a personal computer with voice commands using the Amazon Echo and Amazon Web Services lambda functions. The implemented system processes the user's voice commands from the Amazon web server and sends them to the personal computer. The personal computer processes the received command and uses it to operate the application program.

  • PDF

Diction Problem of Student Singers Based on the Vocal Tract Resonance (성도 공명을 중심으로 한 성악 전공 대학생의 발음법 연구)

  • Kim, Sun-Suk
    • Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.59-72
    • /
    • 2000
  • Vocal tract resonances are of paramount importance to voice sounds. Resonance frequencies determine vowel quality and the personal voice timber. The aim of this study was to make an effective diction program according to tuning formant frequencies by adjusting the vocal tract shape in professional voice users. Twelve male student singers and eleven female student singers participated in this study. The subjects repeated five simple vowels /a, e, i, o, u/ in normal speech and singing. The spoken vowels and sung vowels were measured by formant frequencies and the singer's formant frequencies using CSL and DSP Sona-Graph. Separately, Plot formants program was used to draw the vowel chart. The results were as follows. (1) Total formant frequencies of female singers were 11% higher than those of males singers in singing. (2) The F1 and F3 of sung vowels increased compared to F1 and F3 spoken vowels. However, The F2 of sung vowels decreased in comparison with F2 of spoken vowels. (3) Posterior vowel /u/ were moved anteriorly. This phenomenon seemed to be due to head voice singing training. (4) Singer's formant frequencies in student singers appeared according to the part: 2560 Hz for baritone, 2760 Hz for Tenor, 2821 Hz for Mezzo soprano and 3420 Hz for soprano.

  • PDF

Transmission of Channel Error Information over Voice Packet (음성 패킷을 이용한 채널의 에러 정보 전달)

  • 박호종;차성호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.394-400
    • /
    • 2002
  • In digital speech communications, the quality of service can be increased by speech coding scheme that is adaptive to the error rate of voice packet transmission. However, current communication protocol in cellular and internet communications does not provide the function that transmits the channel error information. To solute this problem, in this paper, new method for real-time transmission of channel error information is proposed, where channel error information is embedded in voice packet. The proposed method utilizes the pulse positions of codevector in ACELP speech codec, which results in little degradation in speech quality and low false alarm rate. The simulations with various speech data show that the proposed method meets the requirement in speech quality, detection rate, and false alarm rate.

Deep Learning based Singing Voice Synthesis Modeling (딥러닝 기반 가창 음성합성(Singing Voice Synthesis) 모델링)

  • Kim, Minae;Kim, Somin;Park, Jihyun;Heo, Gabin;Choi, Yunjeong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.127-130
    • /
    • 2022
  • This paper is a study on singing voice synthesis modeling using a generator loss function, which analyzes various factors that may occur when applying BEGAN among deep learning algorithms optimized for image generation to Audio domain. and we conduct experiments to derive optimal quality. In this paper, we focused the problem that the L1 loss proposed in the BEGAN-based models degrades the meaning of hyperparameter the gamma(𝛾) which was defined to control the diversity and quality of generated audio samples. In experiments we show that our proposed method and finding the optimal values through tuning, it can contribute to the improvement of the quality of the singing synthesis product.

  • PDF

Study on Assessment and Treatment Patterns of Speech-Language Pathologists in Pediatric Vocal Problem Through Multicenter Survey (다기관 설문조사를 통한 국내 소아 음성질환 환자의 검사 및 치료 유형 연구)

  • Lee, Jong-Geun;Bang, Seung-Hwan;Jeon, Jae-Min;Lee, Jung-Kyu;Kim, Angela Yun;Woo, Jeong-Soo;Cho, Jae-Gu
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.30 no.1
    • /
    • pp.39-47
    • /
    • 2019
  • Background and Objectives : Pediatric vocal health problems are relatively common. However, it is not yet well studied whether uniform diagnosis and treatment is done properly in South Korea. The purpose of this study was to investigate the methods that the Korean speech therapists use to diagnose and treat pediatric voice problem. Materials and Method : An anonymous online questionnaire was administered to 32 speech language therapists registered at the Korean laryngeal speech linguistics society detailing demographics, employment institution, general management of pediatric patients with vocal problem including assessment and treatment procedures. Results : Current practice patterns were analyzed on 32 speech language therapists providing services in South Korea mostly working at tertiary university hospital. One third of pediatric patients were assessed without proceeding to treatment. One fifth of patients were treated without assessment. Perceptual assessment was the main pretreatment assessment methods used. Treatment was done in the following order : Voice rest, SOVT, yawn-sigh and resonant voice. Post-treatment evaluation was used in the following order : Instrumental assessment, clinical judgment, and recording comparison. Conclusion : Speech language therapists practice in South Korea mostly follows the ASHA practice guidelines. However, there are still great amount of cases in which only the evaluation was done without appropriate treatment. Further research is needed to make SPLs more systematic and efficient for evaluating and treating pediatric vocal patients.

A Proposal of Eye-Voice Method based on the Comparative Analysis of Malfunctions on Pointer Click in Gaze Interface for the Upper Limb Disabled (상지장애인을 위한 시선 인터페이스에서 포인터 실행 방법의 오작동 비교 분석을 통한 Eye-Voice 방식의 제안)

  • Park, Joo Hyun;Park, Mi Hyun;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.4
    • /
    • pp.566-573
    • /
    • 2020
  • Computers are the most common tool when using the Internet and utilizing a mouse to select and execute objects. Eye tracking technology is welcomed as an alternative technology to help control computers for users who cannot use their hands due to their disabilities. However, the pointer execution method of the existing eye tracking technique causes many malfunctions. Therefore, in this paper, we developed a gaze tracking interface that combines voice commands to solve the malfunction problem when the upper limb disabled uses the existing gaze tracking technology to execute computer menus and objects. Usability verification was conducted through comparative experiments regarding the improvements of the malfunction. The upper limb disabled who are hand-impaired use eye tracking technology to move the pointer and utilize the voice commands, such as, "okay" while browsing the computer screen for instant clicks. As a result of the comparative experiments on the reduction of the malfunction of pointer execution with the existing gaze interfaces, we verified that our system, Eye-Voice, reduced the malfunction rate of pointer execution and is effective for the upper limb disabled to use.

Singing Voice Synthesis Using HMM Based TTS and MusicXML (HMM 기반 TTS와 MusicXML을 이용한 노래음 합성)

  • Khan, Najeeb Ullah;Lee, Jung-Chul
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.5
    • /
    • pp.53-63
    • /
    • 2015
  • Singing voice synthesis is the generation of a song using a computer given its lyrics and musical notes. Hidden Markov models (HMM) have been proved to be the models of choice for text to speech synthesis. HMMs have also been used for singing voice synthesis research, however, a huge database is needed for the training of HMMs for singing voice synthesis. And commercially available singing voice synthesis systems which use the piano roll music notation, needs to adopt the easy to read standard music notation which make it suitable for singing learning applications. To overcome this problem, we use a speech database for training context dependent HMMs, to be used for singing voice synthesis. Pitch and duration control methods have been devised to modify the parameters of the HMMs trained on speech, to be used as the synthesis units for the singing voice. This work describes a singing voice synthesis system which uses a MusicXML based music score editor as the front-end interface for entry of the notes and lyrics to be synthesized and a hidden Markov model based text to speech synthesis system as the back-end synthesizer. A perceptual test shows the feasibility of our proposed system.

A Study on support QoS using Traffic Engineering in WDM Network (WDM에서 트래픽 엔지니어링을 이용한 QoS 보장에 관한 연구)

  • 김용성;김장복
    • Proceedings of the IEEK Conference
    • /
    • 2000.11a
    • /
    • pp.41-44
    • /
    • 2000
  • Because of internet's growth, today's network has a serious bandwidth problem. WDM(Wavelength Division Multiplexing) is a solution of this problem. In the WDM networks, QoS(Quality of Service) is as important as bandwidth. And today's voice over IP technology makes a lot of delay-sensitive internet traffic. De]ay-sensitive internet traffic is growing up, so more QoS is needed. We proposed effective solution to assign QoS.

  • PDF

Voice sensor based PSP timelog collection

  • Ibrahim, Ahmad;Choi, Ho-Jin
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2008.06b
    • /
    • pp.87-90
    • /
    • 2008
  • The purpose of the research is to solve the problem of automating time & schedule management by the user in office or development environment. Maintaining timelog manually is difficult task for the users that are following the Personal Software Process (PSP). In this paper we have discussed the difficulties in automating this task and proposed a solution for this problem.

  • PDF

COMMUNICATION NETWORK DESIGN PROBLEMS USING THE FUZZY SET APPROACH

  • Jin, Chan-yong;Park, Ryun-;Kim, Sam-Soo-
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.1334-1337
    • /
    • 1993
  • In this study, we newly formulated the link capcity allocation problem and the link capacity allocation and routing problem in an voice/data integrated network by the fuzzy set concept. We developed efficient algorithms for the above fuzzified problems and successfully showed that the fuzzy set theory is the powerful tool for the design problems in communication networks.

  • PDF