• Title/Summary/Keyword: Speech acoustics

Search Result 62, Processing Time 0.027 seconds

Harmonic Structure Features for Robust Speaker Diarization

  • Zhou, Yu;Suo, Hongbin;Li, Junfeng;Yan, Yonghong
    • ETRI Journal
    • /
    • v.34 no.4
    • /
    • pp.583-590
    • /
    • 2012
  • In this paper, we present a new approach for speaker diarization. First, we use the prosodic information calculated on the original speech to resynthesize the new speech data utilizing the spectrum modeling technique. The resynthesized data is modeled with sinusoids based on pitch, vibration amplitude, and phase bias. Then, we use the resynthesized speech data to extract cepstral features and integrate them with the cepstral features from original speech for speaker diarization. At last, we show how the two streams of cepstral features can be combined to improve the robustness of speaker diarization. Experiments carried out on the standardized datasets (the US National Institute of Standards and Technology Rich Transcription 04-S multiple distant microphone conditions) show a significant improvement in diarization error rate compared to the system based on only the feature stream from original speech.

The Effects of Air Conditioner Noise on Classroom Acoustics (교실 음향에 대한 에어컨 소음의 영향)

  • Kim, Su-Yeon;Jeon, Jin-Yong
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2005.05a
    • /
    • pp.176-179
    • /
    • 2005
  • A case-study in classroom acoustics was conducted and the effects of two types(system air conditioner and packaged air conditioner) of air conditioner were investigated. Acoustical measurements were made in two different classrooms. Each classroom has different acoustics showing sound quality of air conditioner. Mental concentration test was conducted to evaluate the effects of air conditioner noise with different sound presure level(dBA). Speech intelligibility test was also planed with adopting Korean phonetic balanced words.

  • PDF

A Study on the Performance of TDNN-Based Speech Recognizer with Network Parameters

  • Nam, Hojung;Kwon, Y.;Paek, Inchan;Lee, K.S.;Yang, Sung-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.2E
    • /
    • pp.32-37
    • /
    • 1997
  • This paper proposes a isolated speech recognition method of Korean digits using a TDNN(Time Delay Neural Network) which is able to recognizc time-varying speech properties. We also make an investigation of effect on network parameter of TDNN ; hidden layers and time-delays. TDNNs in our experiments consist of 2 and 3 hidden layers and have several time-delays. From experiment result, TDNN structure which has 2 hidden-layers, gives a good result for speech recognition of Korean digits. Mis-recognition by time-delays can be improved by changing TDNN structures and mis-recognition separated from time-delays can be improved by changing input patterns.

  • PDF

A Study on the Room Acoustics in Churches (교회 건축물의 실내음향 특성에 관한 연구)

  • 주진수
    • Journal of KSNVE
    • /
    • v.9 no.4
    • /
    • pp.681-686
    • /
    • 1999
  • In a church, speech intelligibility is very important together with the reverberance for musical activities. In order to obtain the primary data of a acoustical design for churches records were refereed and churches were measured in Europe and Japan. And in the base of measurements, those were judged by subjective hearing test. As some results, it has been found that the room acoustics of churches were different in a country and the reverberation time was perferred two seconds for speech intelligibility. However, although personal deviations were admitted, more long echoes were preferred for the music.

  • PDF

Image Data Compression Using Laplacian Pyramid Processing and Vector Quantization (라플라시안 피라미드 프로세싱과 백터 양자화 방법을 이용한 영상 데이타 압축)

  • Park, G.H.;Cha, I.H.;Youn, D.H.
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1347-1351
    • /
    • 1987
  • This thesis aims at studying laplacian pyramid vector quantization which keeps a simple compression algorithm and stability against various kinds of image data. To this end, images are devied into two groups according to their statistical characteristics. At 0.860 bits/pixel and 0.360 bits/pixel respectively, laplacian pyramid vector quantization is compared to the existing spatial domain vector quantization and transform coding under the same condition in both objective and subjective value. The laplacian pyramid vector quantization is much more stable against the statistical characteristics of images than the existing vector quantization and transform coding.

  • PDF

A Study on the Sound Effect for Improving Customer's Speech Recognition in the TTS-based Shop Music Broadcasting Service (TTS를 이용한 매장음원방송에서 고객의 인지도 향상을 위한 음향효과 연구)

  • Kang, Sun-Mee;Kim, Hyun-Deuc;Chang, Moon-Soo
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.105-109
    • /
    • 2009
  • This thesis describes the method for well voice announcement using the TTS(Text-To-Speech) technology in the shop music broadcasting service. Offering a high quality TTS sound service for each shop requires a great expense. According to a report on the architectural acoustics the room acoustic indexes such as reverberation time and early decay time are closely connected with a subjective awareness about acoustics. By using the result the customers will be able to recognize better the voice announcement by applying sound effect to speech files made by TTS. The result of an aural comprehension examination has shown better about almost all of the parameters by applying reverb effect to TTS sound.

  • PDF

The Study on Asymmetry between Acoustics and Perception of the Temporal Cues of English Plosives (영어파열음 시구간신호의 음향과 지각 비대칭성 연구)

  • Kang Seok-Han
    • MALSORI
    • /
    • v.55
    • /
    • pp.15-31
    • /
    • 2005
  • This study tests the hypothesis that the voiced-voiceless distinction is influenced by the relationship between acoustics and perception. Production and perception tests are conducted with temporal cues in different environments(CV, VCV, VC). The result showed that acoustic cues indicating significant difference between voiceless/voiced plosives do not behave just as do in perception. The result also showed that there existed an asymmetry between acoustics and perception.

  • PDF

The Application of 1-Dimensional Diffusers in Classroom Acoustics (1차 단순 확산체를 적용한 교실음향설계)

  • Choi, Young-Ji
    • Journal of the Korean Institute of Educational Facilities
    • /
    • v.18 no.5
    • /
    • pp.3-11
    • /
    • 2011
  • In this study, the effect of treating 1-dimensional diffusers on the classroom acoustics was investigated to determine if the diffuser are beneficial for performing the preferred acoustical conditions for speech. A 1/10 scale model of a classroom was used to measure the acoustical parameters, T30, $C_{50}$, STI and SNR in that room. The room acoustical conditions were varied by treating diffusers either on the front or side walls of the classroom. When the diffusers were treated on the side walls around the student's areas, a shorter reverberation time at low frequencies was obtained and resulted in performing uniform reverberation times across the frequency bands. The $C_{50}$ values at mid- and high-frequencies were increased by treating the diffusers either on front or side wall surfaces. The highest STI and SNR values were obtained when the diffuser was treated on the front wall around the teacher's areas. It is found that diffusers are beneficial to increase the intelligibility of speech for the rear seats of the rooms.

  • PDF

The Invention of Reis Telephone and Its Problem of Speech Quality (라이스의 전화기 발명과 통화 음질의 문제)

  • Ku, Ja-Hyon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.6
    • /
    • pp.395-401
    • /
    • 2010
  • Since Philipp Reis succeeded in sending human voices through electric wires well ahead of Elisha Gray and A. G. Bell etc., he deserves to be acknowledged as the inventor of the telephone. Nevertheless, he did not enjoy any honor for his great invention while he was alive. Since he was working in a scientific community, his work was presented not as a patentable invention but as a scientific discovery. In addition, he used the intermittent electricity in accordance with the experimental tradition in European acoustics, occasioning the speech quality of his telephone to have a fatal shortcoming. On the contrary, Bell, who was a novice in electricity and acoustics, employed variable currents to transmit the sound signals, which guaranteed better speech qualities than Reis's.

Towards better acoustic conditions in school buildings in Korea-a need for Korean standard for classroom acoustics (국내 교육시설의 음향기준 제정의 필요성 제고)

  • Young-Ji Choi
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.2
    • /
    • pp.113-123
    • /
    • 2023
  • This paper describes the acoustical conditions of elementary school and high school classrooms as well as university classrooms in Korea and suggests a need for Korean acoustic standards and guidelines for classroom design. Current standards and guidelines of classroom acoustics in several countries were briefly introduced to understand their acoustical performance criteria for background noise levels and reverberation times, and noise isolation design requirements in various types of classrooms. The results of several acoustic survey of domestic classrooms in elementary school, high school, and university were described and compared to provide information of the acoustic characteristics of Korean school classrooms. The survey includes occupied and unoccupied data on the acoustical conditions, noise levels, and noise isolation performance in the classrooms. Acoustical parameter values for achieving 'good' speech intelligibility in active university classrooms were also presented.