• Title/Summary/Keyword: speech source

Search Result 281, Processing Time 0.028 seconds

A Study on the Robust Sound Localization System Using Subband Filter Bank (서브밴드 필터 뱅크를 이용한 강인한 음원 추적시스템에 대한 연구)

  • 박규식;박재현;온승엽;오상헌
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.36-42
    • /
    • 2001
  • This paper propose new sound localization algorithm that detects the sound source bearing in a closed office environment using two microphone array. The proposed Subband CPSP (Cross Power Spectrum Phase) algorithm is a development of previously Down CPSP method using subband approach. It first split the received microphone signals into subbands and then calculates subband CPSP which result in possible source bearings. This type of algorithm, Subband CPSP, can provide more robust and reliable sound localization system because it limits the effects of environmental noise within each subband. To verify the performance of the proposed Subband CPSP algorithm, a real time simulation was conducted and it was compared with previous CPSP method. From the simulation results, the proposed Subband CPSP is superior to previous CPSP algorithm more than 5% average accuracy for sound source detection.

  • PDF

Improvement of Packet Loss Concealment Algorithm by Utilizing Next Good Frame Info. (손실이후 프레임 정보에 의한 패킷손실은닉 알고리즘 개선)

  • Kim Jae-Hyun;Hahn Min-Soo
    • MALSORI
    • /
    • no.43
    • /
    • pp.101-112
    • /
    • 2002
  • In real time packetized voice application, missing packets are major source of voice quality degradation. Thus packet loss concealment (PLC) algorithms are needed to guarantee QoS of VoIP. In this paper, we describe packet loss concealment scheme utilizing the next good frame which follows loss packets. When this scheme is combined with other PLC algorithms, such as G.711 pitch waveform replication recommended by ITU-T LP based PLC algorithm, additional voice quality improvement is obtained for consecutive packet loss larger than 60 msec.

  • PDF

Oral mucormycosis in an 18-month-old child: a rare case report with a literature review

  • Kalaskar, Ritesh Rambharos;Kalaskar, Ashita Ritesh;Ganvir, Sindhu
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.42 no.2
    • /
    • pp.105-110
    • /
    • 2016
  • Oral mucormycosis is a fungal infection observed mainly in elderly immunocompromised patients. In rare instances, the disease occurs in healthy individuals and those patients that are below preschool age. Although this condition mainly involves the maxilla, it may also manifest in any part of the oral cavity based on the source of infection. Mucormycosis of the maxilla spreads rapidly, leading to necrosis of the palatal bone and palatal perforation. Such patients are usually rehabilitated using bone grafting or free flap surgeries. However, when surgeries are delayed, palatal prosthesis is an interim treatment modality that can prevent nasal regurgitation and aspiration of food or fluids. Palatal prostheses also help with mastication, speech, and swallowing. The present case describes a rare case of oral mucormycosis in an 18-month-old male involving the maxilla that was managed by palatal prosthesis.

Context-Awareness based Home Assistant using Open Source Home Py (오픈 소스 Home Py를 이용한 상황인식 홈 비서)

  • Lee, Se-Hoon;Kim, Ju-Yeon;Moon, Sung-Hyun;Lim, Su-Young;Lee, Yoon-Su
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.07a
    • /
    • pp.135-136
    • /
    • 2016
  • 본 논문은 오픈소스 Home Py를 이용해 Telegram Service를 통한 대화방식의 서비스로써 사물과 사람 간 양방향 통신을 가능하며 상황인식 서비스를 활용하여 홈 시스템 제어를 유연하게 할 수 있는 프로젝트를 구성하였다. 기존 시스템은 스마트 폰으로 가전을 제어하는 Smart Home이 현실화 되었지만, 조작법의 어려움으로 인하여 장애인, 노약자, 어린이, 임산부들의 불편함이 있다. 본 문제를 해결하기 위해 상황인식을 통해 상황에 맞는 사물들을 제어함으로써 보다 지능적인 스마트 홈 시스템을 제안한다.

  • PDF

CONCERT HALL ACOUSTICS - Physics, Physiology and Psychology fusing Music and Hall - (콘서트홀 음향 - 음악과 홀을 융합시키는 물리학, 생리학, 심리학 -)

  • 안도요이찌
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1992.06a
    • /
    • pp.3-8
    • /
    • 1992
  • The theory of subjective preference with temporal and spatial factors which include sound signals arriving at both ears is described. Then, auditory evoked potentials which may relate to a primitive subjective response namely subjective preference are discussed. According to such fundamental phenomena, a workable model of human auditory-brain system is proposed. For eample, important subjective attributes, such as loudness, coloration, threshold of preception of a reflection and echo distrubance as well as subjective preference in relation to the initial time delay gap between the direct sound and the first reflection, and the subsequent reverberation time are well described by the autocorrelation function of source signals. Speech clarity, subjective diffuseness as well as subjective preference are related to the magnitude of inter-aural crosscorrelation function (IACC). Even the caktail party effects may be eplained by spatialization of human brain, i.e., independence of temporal and spatial factors.

  • PDF

Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT) (비고정 구간 길이 음향 튜브를 이용한 성도 모델링)

  • Kim, Dong-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.6
    • /
    • pp.1126-1130
    • /
    • 2010
  • Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.

Human-Robot Interaction in Real Environments by Audio-Visual Integration

  • Kim, Hyun-Don;Choi, Jong-Suk;Kim, Mun-Sang
    • International Journal of Control, Automation, and Systems
    • /
    • v.5 no.1
    • /
    • pp.61-69
    • /
    • 2007
  • In this paper, we developed not only a reliable sound localization system including a VAD(Voice Activity Detection) component using three microphones but also a face tracking system using a vision camera. Moreover, we proposed a way to integrate three systems in the human-robot interaction to compensate errors in the localization of a speaker and to reject unnecessary speech or noise signals entering from undesired directions effectively. For the purpose of verifying our system's performances, we installed the proposed audio-visual system in a prototype robot, called IROBAA(Intelligent ROBot for Active Audition), and demonstrated how to integrate the audio-visual system.

Channel Coding Design Combined with Source Coder for Mobile Communication Systems (이동통신시스템을 위한 소스 코더와 결합된 채널코딩 방법 연구)

  • 김종현;이인성강석봉이정구
    • Proceedings of the IEEK Conference
    • /
    • 1998.06a
    • /
    • pp.19-22
    • /
    • 1998
  • In this study, the efficient channel coding method combined with CS-ACELP is proposed. The same convolutional coder and Viterbi decoder of COMA mobile communication system is used as channel coder. To make the best available use of limited channel coding redundancy, unequal error protection of punctured convolutional coder is used for variable reate allocation. But, the overall code rate is given by 2. The performance of proposed coder is analyzed and simulated in a Rayleigh fading channel. Experimental results show that the objective and subjective speech quality of variable rate channel coding methods are superior to those of non-variable channel coding method.

  • PDF

A review of the voice diagnosis studies in Oriental medicine (한의학에서 음성 진단의 현황과 전망에 관한 연구)

  • Cho, Shin-Woong;Park, Young-Bae;Park, Young-Jae
    • The Journal of the Society of Korean Medicine Diagnostics
    • /
    • v.12 no.2
    • /
    • pp.18-26
    • /
    • 2008
  • Purpose : To review studies about voice diagnosis in orieltal medicine Method : The papers reviewed in this study were searched through internet search engines. For chinese studies, China National Knowledge Infrastructure(www.cnki.net) was the main source of the information and the key words for Voice diagnosis studies were "(語聲)", "(聲診)", and (TCM). Conclusions : In Oriental Medicine, There are two ways to research about voices. One way is to research through philological consideration with subjectical and experimental diagnosis & studies as human bowel related in traditional studies. The other way is to research through Computerized Speech Lab(CSL), differential diagnosis for Sasang constitution and disease.

  • PDF

A study on implementation digital programmable CNN with variable template memory (가변적 템플릿 메모리를 갖는 디지털 프로그래머블 CNN 구현에 관한 연구)

  • 윤유권;문성룡
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.10
    • /
    • pp.59-66
    • /
    • 1997
  • Neural networks has widely been be used for several practical applications such as speech, image processing, and pattern recognition. Thus, a approach to the voltage-controlled current source in areas of neural networks, the key features of CNN in locally connected only to its netighbors. Because the architecture of the interconnection elements between cells in very simple and space invariant, CNNs are suitable for VLSI implementation. In this paper, processing element of digital programmable CNN with variable template memory was implemented using CMOS circuit. CNN PE circuit was designe dto control gain for obtaining the optimal solutions in the CNN output. Performance of operation for 4*4 CNN circuit applied for fixed template and variable template analyzed with the result of simulation using HSPICE tool. As a result of simulations, the proposed variable template method verified to improve performance of operation in comparison with the fixed template method.

  • PDF