• Title/Summary/Keyword: 음성활동 검출기

Search Result 6, Processing Time 0.021 seconds

Boll's Spectral Subtraction Algorithm by New Voice Activity Detection (새로운 음성 활동 검출법에 의한 Boll의 스펙트럼 차감 알고리즘)

  • 류종훈;김대경;박장식;손경식
    • Journal of Korea Multimedia Society
    • /
    • v.4 no.1
    • /
    • pp.46-55
    • /
    • 2001
  • In this paper, a new voice activity detection method estimating SNR of enhanced speech with extended spectral subtraction (ESS) is proposed. Voice activity detection is performed by putting an second Wiener filter behind an Wiener filter used in the ESS to estimate speech and noise power of output signal of first Wiener filter. The proposed voice activity detection method does not require many computational loads and performs well under severe input SNR. Boll's spectral substraction algorithm with proposed voice activity detection was compared to ESS under several noise environment having different time-frequency distributions. During speech and non-speech activity, performance of Boll's spectral substraction algorithm with proposed voice activity detection is superior to that of ESS.

  • PDF

Improvement of VAD Performance for the Reduction of the Bit Rate Under the Noise Environment in the G.723.1 (잡음 환경에서의 전송률 감소를 위한 G.723.1 음성활동 검출기 성능 개선에 관한 연구)

  • 김정진;장경아;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.42-47
    • /
    • 2001
  • This paper improves the performance of VAD (Voice Activity Detector) in G.723.1 Annex A 6.3kbps/5.3kbps dual rate speech coder, which is developed for Internet Phone and videoconferencing. The VAD decision is based on a three-level energy threshold. We evaluates for processing time, speech quality, and bit rate. The processing time is reduced due to the accuracy of VAD decision on the silence period. On subjective quality test there is almost no difference compared with the G.723.1. In order to measure the bit rate we count the active speech frame (VAD=1) and we can reduce more bit rate as silence periods are shown.

  • PDF

A Study on a Robust Voice Activity Detector Under the Noise Environment in the G,723.1 Vocoder (G.723.1 보코더에서 잡음환경에 강인한 음성활동구간 검출기에 관한 연구)

  • 이희원;장경아;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.2
    • /
    • pp.173-181
    • /
    • 2002
  • Generally the one of serious problems in Voice Activity Detection (VAD) is speech region detection in noise environment. Therefore, this paper propose the new method using energy, lsp varation. As a result of processing time and speech quality of the proposed algorithm, the processing time is reduced due to the accurate detection of inactive period, and there is almot no difference in the subjective quality test. As a result of bit rate, proposed algorithm measures the number of VAD=1 and the result shows predominant reduction of bit rate as SNR of noisy speech is low (about 5∼10 dB).

New Speech Enhancement Method using Psychoacoustic Criteria (심리 음향 기준을 이용한 새로운 음질 개선 방법)

  • 김대경;박장식;손경식
    • Journal of Korea Multimedia Society
    • /
    • v.4 no.1
    • /
    • pp.56-66
    • /
    • 2001
  • The spectral subtraction algorithm using a criterion based on the human perception has been recently developed. The speech processed with Virag's algorithm sounds more pleasant to a human listener than those obtained by the classical methods. However, Virag's algorithm requires a robust voice activity detector (VAD). In the ESS (extended spectral subtraction) algorithm without VAD, the residual noise becomes more noticeable as the SNR decrease. In this paper we propose a new speech enhancement method, the combination of Wiener filter and spectral subtraction based on noise masking characteristics in the human auditory system. There is no need of VAD because the noise can be successively updated even during speech activity using Wiener filter. The adjustment of the subtraction parameter based on the masking threshold makes the residual noise inaudible. The proposed method has been compared with conventional spectral subtraction algorithms. Objective and subjective evaluation of the proposed system is performed with several noise types having different time-frequency distributions. The application of objective measures, the study of the speech spectrograms, as well as subjective listening tests, confirm that the enhanced speech with proposed algorithm is more pleasant to a human listener.

  • PDF

Development of energy expenditure measurement device based on voice and body activity (음성과 활동량을 이용한 에너지 소모량 측정기기 개발)

  • Im, Jae Joong
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.6
    • /
    • pp.303-309
    • /
    • 2012
  • Energy expenditure values were estimated based on the voice signals and body activities. Voice signals and body activities were obtained using PVDF contact vibration sensor and 3-axis accelerometer, respectively. Vibration caused by voices, activity signals, and actual energy consumption were acquired using data acquisition system and gas analyzer. With the use of power values from the voice signals and weight as independent variables, R-square of 0.918 appeared to show the highest value. For activity outputs, use of signal vector magnitude, body mass index, height, and age as independent variables revealed to provide the highest correlation with actual energy expenditure. Estimation of energy expenditure based on voice and activity provides more accurate results than based on activity only.

A Study on the Radio Transmission of Bio-Signal for Tele-Medicine (원격진료를 위한 생체신호의 무선전송에 대한 연구)

  • 김정년;곽준혁;최조천;조학현
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.3
    • /
    • pp.379-385
    • /
    • 2002
  • Tele-medicine and emergency medical system are necessary for moving from an accidental point or far distance to a hospital and emergency treatment or home treatment before a hospital. Emergency treatment is extremely important in the case of death before arriving a hospital and deformed of disabled by medical treatment delay. A necessary element for this medical system is the emergency communication system. This system is on preparing for an ability of furnishing patient status to a corresponding health service by monitoring the patient at an ambulance of the accident place. This is the transportation of basic biological information of a patient to a medical center by wireless communication system and the corresponding hospital of medical center examine the patient by monitoring, then they can send emergency medical order to the patient for emergency treatment. The TRS is most efficient way of emergency medical communication system, which is currently used with popularity. In this paper studied simultaneously a way of detecting and transporting bio-logical signals, and monitoring of transporting data with communication of voice in the accident place of ambulance.