• 제목/요약/키워드: Digital Voice

검색결과 384건 처리시간 0.026초

디지털 음성 및 영상 처리용 SOC를 위한 ADPCM CODEC 코어의 설계 (A Design of ADPCM CODEC Core for Digital Voice and Image Processing SOC)

  • 정중완;홍석일;한희일;조경순
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(2)
    • /
    • pp.333-336
    • /
    • 2001
  • This paper describes the design and implementation results of 40, 32, 24 and 16kbps ADPCM encoder and decoder circuit, based on the protocol CCITT G.726. We verified the ADPCM algorithm using C language and designed the RTL circuit with Verilog HDL. The circuit has been simulated by Verilog-XL, synthesized by Design Compiler and verified using Xilinx FPGA. Since the synthesized circuit includes a small number of gates, it is expected to be used as a core module in the digital voice and image processing SOC.

  • PDF

Pathological Vibratory patterns of the Vocal Folds Observed by the High Speed Digital Imaging System

  • Niimi, Seiji
    • 대한음성언어의학회:학술대회논문집
    • /
    • 대한음성언어의학회 1998년도 제10회 학술대회 심포지움
    • /
    • pp.208-209
    • /
    • 1998
  • It is generally known that many cases of pathological rough voice are characterized not by simple random perturbations but by quasi-periodic perturbations in the speech wave. However, there are few studies on the characteristics of perturbations in vocal fold vibrations associated with this type of voice. We have been conducting studies of pathological vocal fold vibration using a high-speed digital image recording system developed by our institute, Compared to the ordinary high-speed-motion picture system, the present system is compact and simple to operate and thus, it suited for pathological data collection. (omitted)

  • PDF

직업적 음성사용자의 음성증상 및 '음성건강' 관련 서비스 인지도 조사 (A Survey on the voice symptoms and vocal-health service related experience of occupational voice users)

  • 이은정
    • 디지털융복합연구
    • /
    • 제13권1호
    • /
    • pp.397-405
    • /
    • 2015
  • 본 연구는 직업적 음성사용자들의 음성증상 및 음성건강 관련 서비스 인지도를 알아보기 위해 실시되었다. 교사, 텔레마케터, 치료사들을 대상으로 음성증상의 유무 및 유형, 음성건강 관련 서비스 인지도를 알아본 결과 교사(91.8%), 텔레마케터(97.9%), 치료사(86%)들은 한 가지 이상의 음성증상을 보고하였다. 증상 유형은 '열감, 마름, 마른기침, 통증, 가래생김, 따끔거림, 쉼, 목소리 갈라짐, 부어오름'의 9가지로 분류되었고, 세 집단 모두에서 '마름' 증상이 가장 많았다. 교사의 85.7%, 텔레마케터의 87.8%, 치료사의 66%는 음성사용 관련 전문가의 도움을 받은 경험이 없었으며, '음성언어치료사'와 '언어치료사' 모두를 아는 경우는 각각 19.6%, 19.9%, 72%였다. 음성의 효율적 사용법에 대해 교사의 36.8%, 텔레마케터의 43.6%가 잘 알지 못한다고 하였으며, 교사의 45.3%, 텔레마케터의 43.6%, 치료사의 28%는 음성전문가의 도움이 필요하다고 답했다. 조사 결과, 직업적 음성사용자들의 상당수가 음성증상을 경험하지만 음성건강 관련 전문적 서비스에 대한 인지도는 낮은 것으로 나타났다.

음성인식 기술을 이용한 대화식 언어 학습기 개발 (Development of Language Study Machine Using Voice Recognition Technology)

  • 유재택;윤태섭
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2005년도 학술대회 논문집 정보 및 제어부문
    • /
    • pp.201-203
    • /
    • 2005
  • The best method to study language is to talking with a native speaker. A voice recognition technology can be used to develope a language study machine. SD(Speaker dependant) and SI(speaker independant) voice recognition method is used for the language study machine. MP3 Player. FM Radio. Alarm clock functions are added to enhance the value of the product. The machine is designed with a DSP(Digital Signal Processing) chip for voice recognition. MP3 encoder/decoder chip. FM tumer and SD flash memory card. This paper deals with the application of SD ad SD voice recognition. flash memory file system. PC download function using USB ports, English conversation text function by the use of SD flash memory. LCD display control. MP3 encoding and decoding, etc. The study contents are saved in SD flash memory. This machine can be helpful from child to adult by changing the SD flash memory.

  • PDF

The Movements of Vocal Folds during Voice Onset Time of Korean Stops

  • Hong, Ki-Hwan;Kim, Hyun-Ki;Yang, Yoon-Soo;Kim, Bum-Kyu;Lee, Sang-Heon
    • 음성과학
    • /
    • 제9권1호
    • /
    • pp.17-26
    • /
    • 2002
  • Voice onset time (VOT) is defined as the time interval from the oral release of a stop consonant to the onset of glottal pulsing in the following vowel. VOT is a temporal characteristic of stop consonants that reflects the complex timing of glottal articulation relative to supraglottal articulation. There have been many reports on efforts to clarify the acoustical and physiological properties that differentiate the three types of Korean stops, including acoustic, fiberscopic, aerodynamic and electromyographic studies. In the acoustic and fiberscopic studies for stop consonants, the voice onset time and glottal width during the production of stops has been known as the longest and largest in the heavily aspirated type followed by the slightly aspirated type and unaspirated types. The thyroarytenoid and posterior cricoarytenoid muscles were physiologically inter-correlated for differentiating these types of stops. However, a review of the English literature shows that the fine movement of the mucosal edges of the vocal folds during the production of stops has not been well documented. In recent. years, a new method for high-speed recording of laryngeal dynamics by use of a digital recording system allows us to observe with fine time resolution. The movements of the vocal fold edges were documented during the period of stop production using a fiberscopic system of high speed digital images. By observing the glottal width and the visual vibratory movements of the vocal folds before voice onset, the heavily aspirated stop was characterized as being more prominent and dynamic than the slightly aspirated and unaspirated stops.

  • PDF

Development of Compact Auto Focus Actuator for Camera Phone by Applying New Electromagnetic Configuration

  • Chung, Myung-Jin;Son, Sung-Yong
    • Journal of Mechanical Science and Technology
    • /
    • 제20권12호
    • /
    • pp.2087-2093
    • /
    • 2006
  • In this paper, auto focus actuator, which is used to move a lens module in the mobile phone having a camera module, is developed. Camera module containing auto focus actuator requires to minimize total size because of characteristics of the application area such as mobile phone, digital camera, and personal digital assistant. There are stepping motor, voice coil motor, and piezoelectric motor as auto focus actuator. In this paper, voice coil motor having new electromagnetic configuration is proposed. And actuator using proposed voice coil motor is developed by optimal design method using magnetic circuit analysis. The sectional area of the developed actuator is reduced to 32.4% compared with actuator using general electromagnetic configuration. From the performance test, the developed actuator has moving stroke of 0.64 mm for 2.1 volt, hysteresis of 40 $\mu$m, full stroke current of 54 mA, and unit step motion of 3 $\mu$m.

장애인을 위한 멀티모달 인터페이스 기반의 홈 네트워크 제어 (Home Automation Control with Multi-modal Interfaces for Disabled Persons)

  • 박희동
    • 디지털융복합연구
    • /
    • 제12권2호
    • /
    • pp.321-326
    • /
    • 2014
  • 최근 장애인을 위한 IT 접근성 향상 기술에 대한 요구가 증대되고 있다. 따라서 장애인 IT 사용자를 위하여 음성 인식, 영상 인식, TTS 등과 같은 멀티모달 인터페이스를 지원하는 것이 매우 중요하다. 본 논문에서는 홈 네트워크 제어에 있어서 장애인 IT 접근성 향상 기술의 적용 방안에 대하여 서술한 후, 장애인이 쉽게 홈 네트워크를 제어할 수 있도록 음성 인식 및 애니메이션 UI (User interfaces)등과 같은 멀티모달 인터페이스 기반의 홈 네트워크 제어 시스템 모델을 구현하였다.

Terminal-Assisted Hybrid MAC Protocol for Differentiated QoS Guarantee in TDMA-Based Broadband Access Networks

  • Hong, Seung-Eun;Kang, Chung-Gu;Kwon, O-Hyung
    • ETRI Journal
    • /
    • 제28권3호
    • /
    • pp.311-319
    • /
    • 2006
  • This paper presents a terminal-assisted frame-based packet reservation multiple access (TAF-PRMA) protocol, which optimizes random access control between heterogeneous traffic aiming at more efficient voice/data integrated services in dynamic reservation TDMA-based broadband access networks. In order to achieve a differentiated quality-of-service (QoS) guarantee for individual service plus maximal system resource utilization, TAF-PRMA independently controls the random access parameters such as the lengths of the access regions dedicated to respective service traffic and the corresponding permission probabilities, on a frame-by-frame basis. In addition, we have adopted a terminal-assisted random access mechanism where the voice terminal readjusts a global permission probability from the central controller in order to handle the 'fair access' issue resulting from distributed queuing problems inherent in the access network. Our extensive simulation results indicate that TAF-PRMA achieves significant improvements in terms of voice capacity, delay, and fairness over most of the existing medium access control (MAC) schemes for integrated services.

  • PDF

포커스 / 다양한 기능 지원 통해 기업 경쟁력 제고 한몫

  • 한국데이터베이스진흥센터
    • 디지털콘텐츠
    • /
    • 9호통권100호
    • /
    • pp.90-91
    • /
    • 2001
  • 최근 들어 우리는 VoiceXML에 관해 많은 기업들이 관심을 가지는 경우를 볼 수 있다. 많은 기업들은 이 기술을 통해 얻을 수 있는 이익이 과연 무엇인지 의문을 가지고 있는 것도 사실이다. 기존 기업들은 대부분이 자동 주문과 주문 추적 등 상거래 관리 기능을 담당하는 IVR 시스템과 웹 서버를 갖추고 있다. 만약 이러한 기업들이 VoiceXML을 사용하여 기존 IVR시스템을 재정비한다면 어떤 이익을 얻을 수 있을 것인가라는 질문에 대해 많은 VoiceXML업체들의 대답은 다음과 같다.

  • PDF

쉰목소리 완화를 위한 주파수 영역 음성 강조 필터 설계 (Voice Boosting Filter Design in Frequency Domain for Relief of Husky Voice)

  • 김현태;이상협
    • 한국멀티미디어학회논문지
    • /
    • 제19권12호
    • /
    • pp.1919-1926
    • /
    • 2016
  • The people who complain of pain due to voice causes such as vocal cord nodules is increasing year by year. If the voice is changed, it is possible to give to colleagues discomfort or inconvenience during conversation. In this paper, we propose a way to reduce discomfort by improving the husky voice during the conversation. A VBF (voice boosting filter) is firstly designed to improve the husky voices. This filter may further emphasize the formant frequency components than the frequency components around the formant frequency, because the value is relatively greater than the other frequency. And a fixed-point type DSP chipset, TMS320F2812 is applied to the system, the operating frequency is 150MHz. The system was implemented as a compact for use as a portable, its size is $2.5cm{\times}10cm$. Through the test using three husky voices with some type of statement, it was satisfactory in processing speed and sound quality improvement.