• Title/Summary/Keyword: Speaker characteristics

Search Result 257, Processing Time 0.023 seconds

The Study on the Speaker Adaptation Using Speaker Characteristics of Phoneme (음소에 따른 화자특성을 이용한 화자적응방법에 관한 연구)

  • 채나영;황영수
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2003.06a
    • /
    • pp.6-9
    • /
    • 2003
  • In this paper, we studied on the difference of speaker adaptation according to the phoneme classification for Korean Speech recognition. In order to study of speech adaptation according to the weight of difference of phoneme as recognition unit, we used SCHMM as recognition system. And Speaker adaptation method used in this paper was MAPE(Maximum A Posteriori Probability Estimation), Linear Spectral Estimation. In order to evaluate the performance of these methods, we used 10 Korean isolated numbers as the experimental data. It is possible for the first and the second methods to be carried out unsupervised learning and used in on-line system. And the first method was shown performance improvement over the second method, and hybrid adaptation showed the better recognition results than those which performed each method. And the result of Speaker adaptation using the variable weight according to the phoneme had better than the result using fixed weight.

  • PDF

A Design and Algorithm Implementation of Waveguide for 3way Line Array Speaker (3way 라인어레이 스피커를 위한 웨이브가이드 알고리즘 구현 및 설계)

  • Hwang, Jee Won;Kim, ByunKon;Cho, Juphil
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.1
    • /
    • pp.1-7
    • /
    • 2020
  • Directivity control technology of sound system is a key technology for improving sound quality. Providing a line source rather than a point source in an acoustic system can reduce the effects of attenuation interference at long distances, thereby providing high quality sound. In particular, A line-array speaker system can be used to provide coherent, high-quality sound over long distances. However, high frequencies have shorter wavelengths, so the distance between the speakers of a line array system must be shorter, but there are physical limitations. In this paper, we designed a wave guide and installed it in the speaker's compression driver to solve this problem. We measured and tested various acoustic characteristics to verify the performance of the speaker. As a result, when the line array sound system is constructed using the developed speakers, it is possible to provide a line source in all areas including the treble range, thereby achieving the same effect as a single extended source and providing high quality sound up to far distances.

Acoustic Characteristics of Vowels in Korean Distant-Talking Speech (한국어 원거리 음성의 모음의 음향적 특성)

  • Lee Sook-hyang;Kim Sunhee
    • MALSORI
    • /
    • v.55
    • /
    • pp.61-76
    • /
    • 2005
  • This paper aims to analyze the acoustic effects of vowels produced in a distant-talking environment. The analysis was performed using a statistical method. The influence of gender and speakers on the variation was also examined. The speech data used in this study consist of 500 distant-talking words and 500 normal words of 10 speakers (5 males and 5 females). Acoustic features selected for the analysis were the duration, the formants (Fl and F2), the fundamental frequency and the total energy. The results showed that the duration, F0, F1 and the total energy increased in the distant-talking speech compared to normal speech; female speakers showed higher increase in all features except for the total energy and the fundamental frequency. In addition, speaker differences were observed.

  • PDF

A Study on the improvement of the audio acoustic characteristics by the condition of the duct design (덕트의 설계 조건에 따른 오디오 음향환경 개선에 관한 연구)

  • 김대근
    • Proceedings of the KIPE Conference
    • /
    • 2000.07a
    • /
    • pp.70-73
    • /
    • 2000
  • In this paper we conducted research about the speaker's acoustic characteristics by the condition of the duct. It is expanding the bass ton play frequency as interfere two frequencies each other which originate from the speaker's back and front side using duct as attach the duct of round shaped or square at the encloser. This is not making of bass ton range using interference,. The structure of the duct which using the experiment is round shape. And we confirmed that can expand the limit of bass ton play as compare the actual experimental wave that after simulation of play frequency range as lenth change

  • PDF

Acoustic Analysis and Design of a Direct-Radiator-Type Loudspeaker (직접방사형 스피커의 음향특성 해석및 설계)

  • 김준태;김정호;김진오
    • Journal of KSNVE
    • /
    • v.8 no.2
    • /
    • pp.274-282
    • /
    • 1998
  • A systematic procedure for designing a direct-radiator-type loudspeaker has been developed, based on the numerical vibro-acoustic analysis and the Taguchi method. The finite-element model of the speaker cone has been used to calculated the vibration response of the cone excited by the voice coil. The vibration displacement of the speaker cone has been converted into the vibration velocity and used as a boundary condition for the acoustic analysis. The acoustic frequency characteristics of the loudspeaker have been calculated by the boundary element method. The numerical results have been verified by the experiments carried out in an anechoic chamber. Some design parameters have been selected by using the Taguchi method, and the variations of the acoustic characteristics due to the changes of the parameter values have been examined using the numerical model.

  • PDF

Optimal Design of Acoustical Characteristics of Passenger Compartment (차실 음향 최적 설계에 관한 연구)

  • 김정수;강연준
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2003.11a
    • /
    • pp.183-188
    • /
    • 2003
  • This study is to make the fundamentals of sound quality evaluation in regard of acoustical characteristics of passenger compartment. The deviation of frequency response function level within audible frequency is evaluated at receiving point in the research of room acoustics. In this study, frequency response function is the one between speaker and driver's ear positions. The positions of driver and audio speakers are optimized by analysis of acoustic mode of acoustic cavity. The main reflection planes are determined by analysis sound ray path diffused at optimized speaker positions. Finally, designer selects acoustical material by analysis of absorption effect of acoustical materials on the main reflection planes in order to avoid to distortion and fluctuation of frequency response function..

  • PDF

Characteristics of Laryngeal-Diadochokinesis (L-DDK) in Nonfluent Speakers (비유창성 화자의 후두 교호운동 특성)

  • Han, Ji-Yeon;Lee, Ok-Bun;Park, Hee-Jun;Lim, Hye-Jin
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.55-64
    • /
    • 2007
  • Laryngeal DDK involve with the rate, pattern, and regularity (periodicity) in opening and closing of vocal fold. This study was aimed at investigating the characteristics of laryngeal DDK between nonfluent and fluent speakers. One with an ataxic dysarthria (with cerebellar lesion) and the other with stuttering, and 13 normal speakers were evaluated. L-DDK were analyzed with MSP (motor speech profile, CSL 4400). Measures of DDK included: DDKavr, DDKcvp, DDKjit, DDKavp. An ataxic dysarthric speaker and a stutterer showed more reduced rate and aperiodic L-DDK (both adductory and abductory movement) than normal speakers. But the average L-DDK period (ms) in adductory movement in a speaker with stuttering showed more decreased than the other. Results from this study are preliminary. Nonetheless, results of L-DDK produced by nonfluent speakers suggested the possibility to have relation with slow rate of phonatory initiation and connected speech. In the future, perceptual studies are needed in conjuction with acoustic and speech production.

  • PDF

A Study on Korean and English Speaker Recognitions using the Fuzzy Theory (퍼지 이론을 이용한 한국어 및 영어 화자 인식에 관한 연구)

  • 김연숙;김희주;김경재
    • Journal of the Korea Society of Computer and Information
    • /
    • v.7 no.3
    • /
    • pp.49-55
    • /
    • 2002
  • This paper proposes speaker recognition algorithm which includes both the pitch parameter and the fuzzy. This study proposes a pitch detection method for the peak and valley pitch detection function by means of comparing spectra which utilizes the transform characteristics between time and frequency. It measures the similarity to the original spectrum while arbitrarily varying the period in the time domain. It heavily weights the error due to the changing characteristics of the phonemes, while it is strong against noise. In this paper, makes reference pattern using membership function and performs vocal track recognition of common character using fuzzy pattern matching in odor to include time variation width for non-linear utterance time.

  • PDF

A Study on Korean and Japanese Speaker Recognitions using the Fuzzy Theory (퍼지 이론을 이용한 한국어 및 일어 화자 인식에 관한 연구)

  • 김연숙;김창완
    • Journal of the Korea Society of Computer and Information
    • /
    • v.5 no.3
    • /
    • pp.51-57
    • /
    • 2000
  • This paper proposes speaker recognition algorithm which includes both the pitch and the fuzzy. This study proposes a pitch detection method for the peak and valley pitch detection function by means of comparing spectra which utilizes the transform characteristics between time and frequency. It measures the similarity to the original spectrum while arbitrarily varying the period in the time domain. It heavily weights the error due to the changing characteristics of the phonemes, while it is strong against noise. In this paper, makes reference pattern using membership function and performs vocal track recognition of common character using fuzzy pattern matching in order to include time variation width for non-linear utterance time.

  • PDF

Research on Influencing Factors of Purchasing Behavior of AI Speakers in China based on the UTAUT and TTF Model

  • Wenyan Chang;Jung Mann Lee
    • Journal of Information Technology Applications and Management
    • /
    • v.29 no.5
    • /
    • pp.13-25
    • /
    • 2022
  • The purpose of this study is to explore the factors that influence the purchase of AI speakers in China. We integrate the Unified Theory of Acceptance and Use of Technology (UTAUT) and Task-technology fit (TTF) model into one model and put forward assumptions. According to the characteristics of AI speakers, we selected 6 independent variables, such as Performance Expectation, Effort Expectation, Social Influence, Facilitating Conditions, Task and Technology-characteristics. The final impact on purchase behavior is evaluated through Task-technology fit and purchase intention. After counting 478 samples, through SPSS22.0 and AMOS analysis, hypotheses have been proved by strong experimental data, except facilitating conditions. These results also imply that improving the technical level of AI speakers and enhancing consumers' purchasing intention are the central line of marketing. Based on this, we put forward several suggestions to marketers, including strengthening the research and development of AI speaker technology, and building a circle of friends of AI speakers.