• 제목/요약/키워드: Speech Enhancement

검색결과 340건 처리시간 0.026초

Impact of Voice Activity Detection on Channel Allocation in Cellular Networks

  • Limsaksri, Wichan;Thipchaksurat, Sakchai;Varakulsiripunth, Ruttikorn
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.1067-1071
    • /
    • 2004
  • In this paper, the performance enhancement algorithm of channel allocation for voice and data transmission in cellular networks is proposed. The voice activity detection has been applied to dynamic channel allocation procedure to detect and separate the silence and speech among conversation periods. Hence a data user can use the silent period of an active voice channel to transmit its information. To control the selecting of channel allocation policies, the information of number of data in transmission waiting queue has been determined in order to accept the performance measurement. In the simulation results, the improvement of the performance shows via the quality of services, which are an average delay in queue, a blocking probability, and an impact of the proposed scheme is presented in the system.

  • PDF

텔레메틱스 기반의 통화음질향상을 위한 잡음제거 알고리즘의 성능비교 (Performance Comparison of Noise Reduction Algorithms for Enhancing Voice Quality based on Telematics)

  • 김형국;최홍재
    • 한국ITS학회 논문지
    • /
    • 제11권1호
    • /
    • pp.86-91
    • /
    • 2012
  • 다양한 잡음환경에 노출되는 텔레메틱스 기반의 음성 통화 시스템에서 고품질의 통화 품질을 제공하기 위해서는 저연산량을 가지며 효과적으로 배경 잡음을 제거할 수 있는 잡음제거 알고리즘이 요구된다. 본 논문에서는 Mel-Filter 기반의 잡음제거 알고리즘을 제안하며, 제안된 알고리즘을 기존 잡음제거 알고리즘들과 비교하여 설명한다. 자동차 잡음과 배블 잡음 환경에서 잡음제거 알고리즘의 성능 측정 결과, 제안된 Mel-Filter 기반의 잡음제거 알고리즘이 기존 잡음제거 알고리즘들에 비해 비슷한 PESQ 성능에 적은 연산량을 가지는 장점을 가지고 있으며, 제안된 잡음제거 알고리즘이 텔레메틱스 단말기에서 효과적으로 잡음을 제거할 수 있음을 입증하였다.

Effects of Instructional Intervention in Low-Level College Students' Learning of Request Acts

  • Yang, Eun-Mi
    • 영어어문교육
    • /
    • 제12권2호
    • /
    • pp.215-235
    • /
    • 2006
  • This paper explores the effects of two different methods of instruction for 106 low-level Korean learners of English at a college in learning request expressions. Both of the methods contained the focus-on-form and function characteristics, while the degree of explicitness for input enhancement was differentiated. Abundant email samples written by English native speakers for the input were provided and email writing practice for the output was proceeded for both groups of the students in the treatment sessions. The numbers of target forms used in pretest and posttest results were compared quantitatively: The tests included email writing and open-ended Discourse Completion Test (DCT). The results indicated that the target pragmatic features were slightly better learned under the condition of relatively high degree of explicit instruction with metapragmatic information, even though the difference was statistically insignificant. In addition, the students' use of request strategies both in email and DCT was affected positively by the treatment with email input and output. That is, the students applied the request strategies they learned through email into their oral production (open-ended DCT) as well as their email writing. Further study on the output effect of target features in advancing pragmatic competence is suggested.

  • PDF

치과기공의 악안면 보철분야 도입을 위한 이론적 고찰 (A study of introduction for Maxillofacial prosthesis in Dental Technology)

  • 이희경
    • 대한치과기공학회지
    • /
    • 제29권2호
    • /
    • pp.105-117
    • /
    • 2007
  • As a dental technician, the aim of the present study on maxillofacial prosthesis was to research its relation with dental technology and further development aspects by looking into its history, kinds, production materials and process. Dental technicians are to expect a great potential to work as maxillofacial prosthetist if having an interest in education of maxillofacial prosthesis field, and developing and operating the education process by expanding the range of dental technology. This article is to present overall history of maxillofacial prosthesis and some background information on the materials which have been used from the past. The maxillofacial field plays essential functions of mastication and speech, as well as performs appearance, which evokes good or bad feelings as an instant and instinctive response. The use of maxillofacial prostheses is not merely the replacement of a missing part of the face, resulted from injuries, but a rehabilitation process to help individuals come back to society. Rehabilitation includes both patient's physical and psychological recovery, such as self-esteem and selfconfidence. There has been a rapid development in application potentials of maxillofacial prosthesis technology which include implant, which can penetrate skin, and new materials. In order to produce maxillofacial prosthesis, general procedures of maxillofacial laboratory work should be understood first. Maxillofacial prosthesis and the dental prosthesis have many similarities in its academic perspective and originality. Maxillofacial prosthesis should be added into the curriculum for dental technology to achieve co-enhancement of the two fields.

  • PDF

하이드로폰 송신 어레이를 이용한 수중 음향 통신 시스템의 성능 향상 (Performance Enhancement of Underwater Acoustic Communication System Using Hydrophone Transmit Array)

  • 이외형;손윤준;김기만
    • 한국음향학회지
    • /
    • 제21권7호
    • /
    • pp.606-613
    • /
    • 2002
  • 본 논문에서는 수중에서 송신 빔 형성기를 이용한 고속 데이터 전송 기법을 연구하였다. 또한 범용 디지털 신호처리 프로세서와 다수의 디지탈-아날로그 변환기를 이용한 시험용 송신단을 설계 및 구현하였으며, 구현된 시스템을 이용하여 수조에서 실험을 수행하여 그 성능을 분석하였다. 이때 실험 과정을 단순화하기 위하여 채널 코딩 및 등화기 (equalizer) 등과 같은 과정은 생략하였고, 간장 간단한 디지털 통신 변조 기법인 OOK(On-Off keying) 기법을 사용하였다. 실험 결과 5개의 하이드로폰 송신 어레이를 사용한 경우에 1개만 사용했을 때보다 오차율 10/sup -2/을 기준으로 전송 속도가 약 3배 향상되었으며, 실험에 사용된 수조에서 음성 신호 전송을 위해 400 bps 정도까지 가능함을 확인하였다.

An Adaptive Utterance Verification Framework Using Minimum Verification Error Training

  • Shin, Sung-Hwan;Jung, Ho-Young;Juang, Biing-Hwang
    • ETRI Journal
    • /
    • 제33권3호
    • /
    • pp.423-433
    • /
    • 2011
  • This paper introduces an adaptive and integrated utterance verification (UV) framework using minimum verification error (MVE) training as a new set of solutions suitable for real applications. UV is traditionally considered an add-on procedure to automatic speech recognition (ASR) and thus treated separately from the ASR system model design. This traditional two-stage approach often fails to cope with a wide range of variations, such as a new speaker or a new environment which is not matched with the original speaker population or the original acoustic environment that the ASR system is trained on. In this paper, we propose an integrated solution to enhance the overall UV system performance in such real applications. The integration is accomplished by adapting and merging the target model for UV with the acoustic model for ASR based on the common MVE principle at each iteration in the recognition stage. The proposed iterative procedure for UV model adaptation also involves revision of the data segmentation and the decoded hypotheses. Under this new framework, remarkable enhancement in not only recognition performance, but also verification performance has been obtained.

지연 추정 기능을 갖는 적응 마이크로폰 어레이 알고리즘 (Adaptive Microphone Array System with Self-Delay Estimator)

  • 정양원;강홍구;이충용;윤대희
    • 한국통신학회논문지
    • /
    • 제30권1C호
    • /
    • pp.54-60
    • /
    • 2005
  • 본 논문은 지연 추정 기능을 갖는 적응 마이크로폰 어레이 알고리즘을 제안한다. Generalized sidelobe canceller (GSC)의 적응 차단 행렬이 각 센서간의 상호 시간 지연을 추정할 수 있다는 것을 보임으로써, 제안한 시스템은 적응 차단 행렬을 목적 신호의 차단 뿐 아니라 각 센서의 시간 지연 추정을 위해 사용한다. 이로 인해, 제안한 시스템은 GSC 구조만을 사용하면서. 시간 지연 추정기를 외부의 전처리기로 사용하는 기존의 시스템과 같은 성능을 얻을 수 있다. 실제 환경에서의 실험 결과를 통해 제안한 시스템의 성능이 기존의 시스템과 유사함을 확인하였다.

확산필터뱅크를 전처리기로 사용한 한국어 단모음인식 (The Recognition of Korean Single vowels by Use of the Diffusion Filter Bank as a Pre-processor)

  • 허만탁;김재창
    • 한국음향학회지
    • /
    • 제16권1호
    • /
    • pp.81-87
    • /
    • 1997
  • 본 논문에서는 스펙트럼 포락선을 이용하여 음성을 인식하기 위한 새로운 전처리 방법을 제안한다. 이는 확산필터뱅크를 사용하여 스펙트럼 포락선을 추출하는 새로운 방법이다. 확산필터뱅크의 분석대역을 몇 개의 작은 대역으로 나눔으로써 확산회수를 줄였으며 차분회수를 늘임으로써 선택도를 높였다. 이 결과, 총처리시간을 대폭 줄였으며 스펙트럼의 변별력을 증가시켰다. 컴퓨터 시뮬레이션을 통하여 간단한 인식 알고리듬으로 실제 음성의 단모음 인식 실험을 해본 결과 3%의 인식율을 얻음으로써 확산필터뱅크가 많은 주파수 성분을 가진 음성의 주파수 분석을 이용하는 음성인식에 대단히 유효하다는 것을 확인하였다.

  • PDF

자발성 두개강내 저혈압성 두통 환자에서 치료 도중 발생한 경막하혈종 - 증례보고 - (A Case of Subdural Hematoma after Epidural Blood Patch in a Spontaneous Intracranial Hypotensive Patient - A case report -)

  • 김의석;한경림;김찬
    • The Korean Journal of Pain
    • /
    • 제20권2호
    • /
    • pp.235-239
    • /
    • 2007
  • Spontaneous intracranial hypotension (SIH) is believed to be a benign disease. However, numerous studies have reported serious complications related to SIH, including subdural hematoma. In this case report, a 54-year-old male patient visited the emergency room with orthostatic headache. A brain magnetic resonance imaging (MRI) study showed diffuse mild thickening and enhancement of pachymeninges, with a suspicious minimal amount of subdural fluid collected in the left posterior parietal area. His orthostatic headache showed no improvement with conservative treatment; but his pain was almost completely relieved after two trials of cervical epidural blood patch. On the 74th day after the onset of his pain, the patient showed a drowsy mental status and slurred speech when he visited the pain clinic. Brain computerized tomography indicated a left subdural hemorrhage, and he underwent emergency operation to drain the SDH. In conclusion, pain clinicians should pay attention to abrupt changes in mental status as well as continuous headache, for the early diagnosis of SDH in SIH patients.

Blind Audio Source Separation Based On High Exploration Particle Swarm Optimization

  • KHALFA, Ali;AMARDJIA, Nourredine;KENANE, Elhadi;CHIKOUCHE, Djamel;ATTIA, Abdelouahab
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권5호
    • /
    • pp.2574-2587
    • /
    • 2019
  • Blind Source Separation (BSS) is a technique used to separate supposed independent sources of signals from a given set of observations. In this paper, the High Exploration Particle Swarm Optimization (HEPSO) algorithm, which is an enhancement of the Particle Swarm Optimization (PSO) algorithm, has been used to separate a set of source signals. Compared to PSO algorithm, HEPSO algorithm depends on two additional operators. The first operator is based on the multi-crossover mechanism of the genetic algorithm while the second one relies on the bee colony mechanism. Both operators have been employed to update the velocity and the position of the particles respectively. Thus, they are used to find the optimal separating matrix. The proposed method enhances the overall efficiency of the standard PSO in terms of good exploration and performance. Based on many tests realized on speech and music signals supplied by the BSS demo, experimental results confirm the robustness and the accuracy of the introduced BSS technique.