• Title/Summary/Keyword: Voice communication

Search Result 1,029, Processing Time 0.022 seconds

A Study on Stable Motion Control of Humanoid Robot with 24 Joints Based on Voice Command

  • Lee, Woo-Song;Kim, Min-Seong;Bae, Ho-Young;Jung, Yang-Keun;Jung, Young-Hwa;Shin, Gi-Soo;Park, In-Man;Han, Sung-Hyun
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.21 no.1
    • /
    • pp.17-27
    • /
    • 2018
  • We propose a new approach to control a biped robot motion based on iterative learning of voice command for the implementation of smart factory. The real-time processing of speech signal is very important for high-speed and precise automatic voice recognition technology. Recently, voice recognition is being used for intelligent robot control, artificial life, wireless communication and IoT application. In order to extract valuable information from the speech signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. Basic method used for extracting the features of the voice signal is to find the Mel frequency cepstral coefficients. Mel-frequency cepstral coefficients are the coefficients that collectively represent the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. The reliability of voice command to control of the biped robot's motion is illustrated by computer simulation and experiment for biped walking robot with 24 joint.

MAC Protocol based on Spreading Code Status-Sensing Scheme for Integrated Voice/Data Services (확산코드 상태 감지 기법에 의한 통합 음성/데이터 서비스 MAC 프로토콜)

  • 임인택
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.5 no.5
    • /
    • pp.916-922
    • /
    • 2001
  • A medium access control protocol is proposed for integrated voice and data services in the packet CDMA network with a small coverage. Uplink channels are composed of time slots and multiple spreading codes for each slot. This protocol gives higher access priority to the delay-sensitive voice traffic than to the data traffic. During a talkspurt, voice terminals reserve a spreading code to transmit multiple voice packets. On the other hand, whenever generating a data packet, data terminals transmit a packet based on the status Information of spreading codes in the current slot, which is received from base station. In this protocol, voice packet does not come into collision with data packet. Therefore, this protocol can increase the maximum number of voice terminals.

  • PDF

A Study on the design of voice cryptograph system (음성암호시스템 설계에 관한 연구)

  • Choi, Tae-Sup;Ahn, In-Soo
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.2
    • /
    • pp.51-59
    • /
    • 2002
  • In this paper, we studied the voice cryptograph system designed by the SEED algorithm for the safe transmission and receipt on the voice communication. Voice band signal converts to digital signal by the CODEC and DSP that applied the improved SEED algorithm encrypt the digital signal. The CODEC convert Encryption signal into analog voice signal. This voice signal is transmitted safely because of encryption signal even if someone wiretap. Receiver can hear the source voice, because the encryption signal decrypted using the SEED algorithm. In this paper, We designed the 32 round key instead of 16 round key in the SEED algorithm so that we improve the truncated differential probability from $2^{-143.1}$ to $2^{-286.6}$

Discussions on Auditory-Perceptual Evaluation Performed in Patients With Voice Disorders (음성장애 환자에서 시행되는 청지각적 평가에 대한 논의)

  • Lee, Seung Jin
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.3
    • /
    • pp.109-117
    • /
    • 2021
  • The auditory-perceptual evaluation of speech-language pathologists (SLP) in patients with voice disorders is often regarded as a touchstone in the multi-dimensional voice evaluation procedures and provides important information not available in other assessment modalities. Therefore, it is necessary for the SLPs to conduct a comprehensive and in-depth evaluation of not only voice but also the overall speech production mechanism, and they often encounter various difficulties in the evaluation process. In addition, SLPs should strive to avoid bias during the evaluation process and to maintain a wide and constant spectrum of severity for each parameter of voice quality. Lastly, it is very important for the SLPs to perform a team approach by documenting and delivering important information pertaining to auditory-perceptual characteristics in an appropriate and efficient way through close communication with the laryngologists.

Service Quality Criteria for Voice Services over a WiBro Network (와이브로 네트워크를 통한 음성 서비스의 측정 기반 품질 기준 수립)

  • Kim, Beom-Joon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.6 no.6
    • /
    • pp.823-829
    • /
    • 2011
  • This paper covers the service quality of packet-based voice service that is provided over a wireless broadband (WiBro) network. Using a measurement software that has been developed in the course of preparing a advanced service quality management scheme for the packet-based voice service over a wireless network[2][3], a huge scale of experiment is conducted to measure the real quality of the voice service. Based on our analysis of the measurement result, the service quality of the voice service is supposed to be quite good over WiBro networks. In addition, another experiment to investigate the effect of degradation of wireless transmission conditions on the service quality of the voice service shows the values of wireless service metris in which mean opinion score (MOS) starts to decrease.

Service Quality Criteria for Voice Services over a HSDPA System (HSDPA 시스템을 통한 음성 서비스의 측정 기반 품질 기준 수립)

  • Kim, Beom-Joon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.7 no.2
    • /
    • pp.249-255
    • /
    • 2012
  • This paper covers the service quality of packet-based voice service that is provided over a high speed downlink packet access (HSDPA) system. Using the measurement software that has been developed in the course of preparing a advanced service quality management scheme for the packet-based voice service over a wireless network[2][3], a huge scale of experiment is conducted to measure the real quality of the voice service. Based on our analysis of the measurement result, the service quality of the voice service is supposed to be quite good over HSDPA system. In addition, another experiment to investigate the effect of degradation of wireless transmission conditions on the service quality of the voice service shows the values of wireless service metrics in which mean opinion score (MOS) starts to decrease.

Effects of EAI and VAS on perceptual judgement and confidence rating by listeners for voice disorders (청지각적 평가 방식에 따른 음성장애 심한 정도 판단과 자가 신뢰도에 대한 차이)

  • Lee, Ok-Bun;Kim, Sun-Hee;Jeong, Hanjin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.5
    • /
    • pp.3046-3050
    • /
    • 2014
  • The purpose of the present study was to evaluate the effect of 7-point interval scale(EAI) and visual analogue scale(VAS) on perceptual judgement and the reliability of severity on voice problems by dysphonic speakers. 30 undergraduate students studying communication disorder were enrolled in the perceptual evaluation. Those listeners judged overall voice severity within the anchored(condition 1) and non-anchored scales(condition 2) for vowel prolongation and reading tasks by 25 speakers with voice disorder. The results of this study showed that the scores by VAS was significantly higher than EAI in both condition 1 and condition 2 for vowel prolongation and reading task. However, the scores by EAI method was higher than by VAS method on voice severity of vowel prolongation (condition 1) and reading task(condition 2). These results suggest auditory-perceptual scaling procedures must be more studied in the aspects of clinical application of voice disorder.

Kiosk for the Visually Impaired using Voice Recognition (음성인식 기능을 이용한 시각장애인용 키오스크)

  • Kim, Dae-Young;Lee, Ah-Hyun;Lee, Gun-Haeng;Kim, Se-Hyun;Lee, Boong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.5
    • /
    • pp.873-882
    • /
    • 2022
  • In this paper, we studied the voice recognition system kiosk for convenience, thinking that the kiosk widely used in modern society should compensate for the inconvenience of using by the visually impaired. Using ultrasonic sensor and PIR(Passive Infrared), it recognizes the visually impaired within the range of 80cm-40cm, introduces the kiosk through the MP3 module and induces them to come closer. Also, when the visually impaired within 40cm is recognized, the product description and order are guided through the MP3 module. A recording-based data voice recognition system and a kiosk that outputs desired items through servo motors were studied. A kiosk for the convenience of the visually impaired was manufactured through operation and optimization experiments of PIR, ultrasonic, voice recognition, and shock sensor for the manufactured voice recognition kiosk. Finally, it was confirmed that security can be strengthened by using shock sensors and emergency bells to enhance security.

Study on Motivation and Satisfaction of Voice Chat Service (음성채팅서비스사용자의이용동기와만족감)

  • Eunji Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.205-210
    • /
    • 2024
  • Nowadays, online messengers are the main communication tool of modern people. Currently, not only messengers that communicate based on text and images, but also services that can interact in real time through voice or screen sharing are actively used by the MZs. This study aims to figure out 1) the motivation of users of voice chat services, and 2) to explore the influence of motivation for use on satisfaction that one of the factors that determine the user's experience. As a result, five major motivations for using voice chat service(Relationship formation, Usefulness, Relationship maintenance, communication supplementation, and distance overcoming) were found. Among them 'Usefulness' and 'Relationship maintenance had a positive effect on user satisfaction. This study, highlighted the various needs of users who communicate in a non-face-to-face environments as well as factors to be satisfied for their positive experiences. These results should be actively used in the online communications market.

Design and Development of Open-Source-Based Artificial Intelligence for Emotion Extraction from Voice

  • Seong-Gun Yun;Hyeok-Chan Kwon;Eunju Park;Young-Bok Cho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.9
    • /
    • pp.79-87
    • /
    • 2024
  • This study aims to improve communication for people with hearing impairments by developing artificial intelligence models that recognize and classify emotions from voice data. To achieve this, we utilized three major AI models: CNN-Transformer, HuBERT-Transformer, and Wav2Vec 2.0, to analyze users' voices in real-time and classify their emotions. To effectively extract features from voice data, we applied transformation techniques such as Mel-Frequency Cepstral Coefficient (MFCC), aiming to accurately capture the complex characteristics and subtle changes in emotions within the voice. Experimental results showed that the HuBERT-Transformer model demonstrated the highest accuracy, proving the effectiveness of combining pre-trained models and complex learning structures in the field of voice-based emotion recognition. This research presents the potential for advancements in emotion recognition technology using voice data and seeks new ways to improve communication and interaction for individuals with hearing impairments, marking its significance.