• Title/Summary/Keyword: Digital Voice

Search Result 386, Processing Time 0.024 seconds

Recognition of Individual Cattle by His and /or Her Voice

  • Yoshio, Ikeda;Yohei, Ishii
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 1998.06b
    • /
    • pp.270-275
    • /
    • 1998
  • It was assumed that the voice of cattle is generated with the virtual white noise through the digital filter called the linear prediction filter, and filter parameters (prediction coefficients) were estimated by the maximum entropy method (MEM) , using the sound signal of the animal . The feature planes were defined by the pairs of two parameters selected appropriately from these parameters. The cattle voices were divided into three levels, that is the high, medium and low levels according to their total power equivalent to the variances of the sound signal . It was found that the straight lines could be used for recognizing tow cow and one calf for high level voices. For high and medium level voices, however, it was difficult or impossible to recognize individual cattle on the parameters planes.

  • PDF

Design and Implementation of Speech-Training System for Voice Disorders (발성장애아동을 위한 발성훈련시스템 설계 및 구현)

  • 정은순;김봉완;양옥렬;이용주
    • Journal of Internet Computing and Services
    • /
    • v.2 no.1
    • /
    • pp.97-106
    • /
    • 2001
  • In this paper, we design and implement complement based speech training system for voice disorder. The system consists of three level of training: precedent training, training for speech apprehension and training for speech enhancement. To analyze speech of voice disorder, we extracted speech features as loudness, amplitude, pitch using digital signal processing technique. Extracted features are converted to graphic interface for visual feedback of speech by the system.

  • PDF

A study for maximum channelizing by FIR filter in voice band (음성대역에서 FIR필터에 의한 최대 채널화에 관한 연구)

  • Kim, Seong-Cheol;Park, Kyung-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.8
    • /
    • pp.1472-1477
    • /
    • 2007
  • Users are offered by the multimedia service of various information on current information-oriented society. The digitize became essential that process of various data is not to selected. Also, Filter technology is required to use the lacking frequency resources efficiently. This paper designs FIR digital band-pass filter of the voice band by narrow band pass filter md verify the characteristics of filter to use by the DSP practice SET.

A Study on Inspection Reliability Evaluation of Electric Rice Cooker FCT Inspection Automation System (전기밥솥 FCT 검사 자동화 System의 검사 신뢰성 평가에 관한 연구)

  • Jeong, Hae-Jin;Lee, Jong-Chan
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.21 no.6
    • /
    • pp.30-35
    • /
    • 2022
  • This study has focused on the reliability evaluation of FCT inspection automation equipment for electric rice. To evaluate the reliability of FCT inspection automation equipment, voice analysis, Gray/R/G/B channel experiment, FND segment experiment, and robot position repeatability were performed. In the voice analysis experiment, the comparison value between the recorded and digital output waves was over 99%, indicating a very high result. It was confirmed that both the gray/R/G/B experiment using vision and the FND segment could confirm the output value of the product through vision. The position repeatability of the robot is also excellent, so it is concluded that the inspection effect through the FCT automation system will be excellent.

Capacity Evaluation of a Digital Cellular CDMA System for Reverse Link (역방향 링크에 대한 디지털 셀룰러 CDMA 시스템의 용량 평가)

  • Park, Yong-Seo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1
    • /
    • pp.48-55
    • /
    • 1995
  • The capacity of a digital cellular CDMA system is evaluated by computer simulation for reverse link (mobile-to-base) with 37 hexagonal cells including 3 rings from a center cell. It is assumed that the channels have shadow fading and the system is ideally power controlled. In this paper the capacity of CBMA system is evaluated for various propagation exponents, voice activity factors and neighboring cell traffics. The following results are obtained. The capacity of CDMA system is increased according to the increase of the propagation exponents and the decrease of the voice activity factors. Its capacity is about 15 and 5 times larger thari that of analog cellular FM/FDMA and digital cellular TDMA for $\gamma=4$ respectively, and is very sensitive to the neighboring cell traffics.

  • PDF

Verification of AI Voice User Interface(VUI) Usability Evaluation : Focusing on Chinese Navigation VUI (인공지능 음성사용자 인터페이스 사용성 평가 기준 검증 : 중국 내비게이션 VUI를 중심으로)

  • Zhou, Yi Mou;Shang, Lin Rru;Lim, Hyun Chan;Hwang, Mi Kyung
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.7
    • /
    • pp.913-921
    • /
    • 2021
  • After arranging the general usability evaluation criteria of existing VUI researchers, this study verified how appropriate these criteria are for AI VUI specialized in navigation and the priority of their suitability. The VUI used in this study was analyzed through a survey from a total of 195 Chinese users after analyzing the navigation VUI used in China. As a result of the analysis, the usability evaluation criteria of the navigation VUI were extracted from three sub-factors of 'task accuracy', 'function satisfaction', and 'information reliability' in verifying conformance with general VUI evaluation criteria. With the recent advent of self-driving cars, safety and response speed are becoming very important, so Chinese users also ranked responsiveness as the top priority in VUI design, and the importance was also found to be high. Also, both men and women have the highest reactivity and the lowest multiplicity. VUI requires a convenient and natural interface to understand the intention between two objects through usability evaluation and verification in order to have effective interaction between humans and machines.

Vocal acoustic characteristics of speakers with depression (우울증 화자 음성의 음향음성학적 특성)

  • Baek, Yeon-Sook;Kim, Se-Joo;Kim, Eun-Yeon;Choi, Yae-Lin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.91-98
    • /
    • 2012
  • The purposes of this paper is to study the characteristics of compared to the speakers voice without depression and speakers with depression, and to propose a objective method for the measurement of the therapeutic effects as well as for diagnostics of depression based on the characteristics. The voice samples obtained from 11 female speakers with depression, aged from 20 to 40, diagnosed as having major depressive disorder by an psychiatrist were compared with those from 12 normal controls with matched sex, age, height, weight, education, smoking, and drinking. The voice samples are taken by a portable digital recorder(TASCAM DR-07, Japan) and analysed using the MDVP(Multi-Dimentional Voice Program) software module from CSL(Computerized Speech Lab, kay elemetrics, co, model 4100). The result of the investigation are as following. First, the average speaking fundamental frequency and loudness range of the speakers with depression group was statistically significantly lower than that of the control group. The pitch range of the control group was rather higher than that of the speakers with depression group, but without statistical significance. Overall speech rates have no statistical difference between two groups. Second, the average speaking fundamental frequency and loudness range have statistically significant negative correlation with Beck Depression Inventory, i. e. more severe depression exhibits lower average speaking fundamental frequency and loudness range. Other vocal parameters such as pitch range and overall speech rate have no statistically meaningful correlations with Beck Depression Inventory.

Transmission of Channel Error Information over Voice Packet (음성 패킷을 이용한 채널의 에러 정보 전달)

  • 박호종;차성호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.394-400
    • /
    • 2002
  • In digital speech communications, the quality of service can be increased by speech coding scheme that is adaptive to the error rate of voice packet transmission. However, current communication protocol in cellular and internet communications does not provide the function that transmits the channel error information. To solute this problem, in this paper, new method for real-time transmission of channel error information is proposed, where channel error information is embedded in voice packet. The proposed method utilizes the pulse positions of codevector in ACELP speech codec, which results in little degradation in speech quality and low false alarm rate. The simulations with various speech data show that the proposed method meets the requirement in speech quality, detection rate, and false alarm rate.

Implementation of QoS-Measuring System for Voice over IP (VoIP(Voice over Internet Protocol) 품질 측정을 위한 UA(User Agent) 및 서버 기능 연구)

  • Kang, Hyun-Joong;Nam, Heung-Woo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.1 s.45
    • /
    • pp.137-144
    • /
    • 2007
  • Advances in networking technology digital media, and codecs have made it possible for the Internet evolves into a Broadband convergence Network (BcN) and provides various services including Voice over Internet Protocol (VoIP) and IPTV over their high-speed IP networks. In order for the Internet to make a profit as traditional Public Switched Telephone Network (PSTN), it must provide high qualify VoIP services. Therefore, real time qualify measurement framework is the most important requisite to provide VoIP service. For this, IETF (Internet Engineering Task Force) defined RTCP-Extended Reports (RTCP-XR) that extend RTCP (Real-Time Transport Protocol Control Protocol). However, procedure and method tot actually VoIP qualify measurement did not recommended nothing but defined item to measure voice quality. Our objective in this paper is to describes a practical measuring framework for end-to-end QoS of switched voice packet in an IP environment. It includes concepts as well as step-by-step procedures for measuring packetized voice streams. It also proposes new formats that extend RTCP-XR's concept.

  • PDF

The Advanced Digital Special Images and Technology

  • Nakajima, Masayuki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1996.06b
    • /
    • pp.50-55
    • /
    • 1996
  • Multimedia boom has happened worldwide these days. In multimedia, we use several kinds of media such as character, figure, voice, music, still images, moving picture etc.. Then I think image including moving picture is the most effective and important media for human being. Creating digital images using a computer has the following two main approaches, depending on how the computer is used. 1. CG Technology. Created images, produced through computer graphics. 2. Digital Image Processing. Images processed through digital image processing technologies. Approach (1) is very popular as Computer Graphics. Two-dimensional and three-dimensional computer graphics techniques are used over wide applications today. On the other hand, Approach (2), which uses digital image processing technology, has been attracting attention lately, in the filed of movies and television. In this report, I will introduce these approaches of CG and digital image processing, and show some application fields such as current movies.

  • PDF