• 제목/요약/키워드: Speech discrimination

검색결과 157건 처리시간 0.023초

FM 방송 중 블록 단위 음성 음악 판별 시스템의 설계 및 구현 (Design and Implementation of Speech Music Discrimination System per Block Unit on FM Radio Broadcast)

  • 장현종;엄정권;임준식
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국지능시스템학회 2007년도 추계학술대회 학술발표 논문집
    • /
    • pp.25-28
    • /
    • 2007
  • 본 논문은 FM 라디오 방송의 오디오 신호를 블록 단위로 음성 음악을 판별하는 시스템을 제안하는 논문이다. 본 논문에서는 음성 음악 판별 시스템을 구축하기 위해 다양한 특정 파라미터와 분류 알고리즘을 제안 한다. 특정 파라미터는 신호처리 분야(Centroid, Rolloff, Flux, ZCR, Low Energy), 음성 인식 분야(LPC, MFCC), 음악 분석 분야(MPitch, Beat)에서 각각 사용되는 파라미터를 사용하였으며 분류 알고리즘으로는 패턴인식 분야(GMM, KNN, BP)와 퍼지 신경망(ANFIS)을 사용하였고, 거리 구현은 Mahalanobis 거리를 사용하였다.

  • PDF

한국어 3중 대립 음소에 대한 일본인의 지각적 범주화 (Japanese Adults' Perceptual Categorization of Korean Three-way Distinction)

  • 김지현;김정오
    • 한국인지과학회:학술대회논문집
    • /
    • 한국인지과학회 2005년도 춘계학술대회
    • /
    • pp.163-167
    • /
    • 2005
  • Current theories of cross-language speech perception claim that patterns of perceptual assimilation of non-native segments to native categories predict relative difficulties in learning to perceive (and produce) non-native phones. Perceptual assimilation patterns by Japanese listeners of the three-way voicing distinction in Korean syllable-initial obstruent consonants were assessed directly. According to Speech Learning Model (SLM) and Perceptual Assimilation Model (PAM), the resulting perceptual assimilation pattern predicts relative difficulty in discrimination between lenis and aspirated consonants, and relative ease in the discrimination of fortis. This study compared the effects of two different training conditions on Japanese adults’perceptual categorization of Korean three-way distinction. In one condition, participants were trained to discriminate lenis and aspirated consonants which were predicted to be problematic, whereas in another condition participants were trained with all three classes of 'learnability' did not seem to depend lawfully on the perceived cross-language similarity of Korean and Japanese consonants.

  • PDF

다운증후군 학생의 음운인식 능력 (Phonological Awareness Ability of Students with Down Syndrome)

  • 황보명
    • 음성과학
    • /
    • 제15권3호
    • /
    • pp.79-94
    • /
    • 2008
  • The purpose of this study was to compare phonological awareness ability of students with Down Syndrome(DS) and typically developing children(TD). TD and DS were equal the reading abilities(reading recognition). The subject were 10 DS and 10 TD, and were examined by test of phonological awareness. The test of phonological awareness was composed according to phonological units(word, syllable, phoneme) and task types(deletion, discrimination, blending). The results obtained in this study were as follows: The total score of phonological awareness ability of DS were significantly lower than TD. And the score of phonological awareness ability according to phonological units and task types were significantly lower than TD. But both DS and TD performed better on phonological deletion and blending task than discrimination. TD and DS represented different correlation between task types and phonological units. This means that TD performed better on all types of tasks and phonological units than DS.

  • PDF

실어증 선별검사 도구개발을 위한 예비연구 (A Preliminary Study for Development of the Aphasia Screening Test)

  • 김향희;이현정;김덕용;허지회;김용욱
    • 음성과학
    • /
    • 제13권2호
    • /
    • pp.7-18
    • /
    • 2006
  • An aphasia screening test can serve a main purpose of differentiating aphasics from non-aphasic patients in a quick as well as efficient manner. As a preliminary study for developing a standardized aphasia screening test for Korean patients, we constructed an aphasia screening test constituting items from the Paradise' Korean version-the Western Aphasia Battery(P K-WAB). All test items were analyzed in order to extract items with optimal item discrimination and adequate item difficulty indices. From the results, we were able to select some items from each subtest with optimal results of discriminant function analysis for aphasic and normal control groups. It is expected, thus, that information on the item analysis could be utilized in developing a Korean aphasia screening test.

  • PDF

Teaching Pronunciation Using Sound Visualization Technology to EFL Learners

  • Min, Su-Jung;Pak, Hubert H.
    • 영어어문교육
    • /
    • 제13권2호
    • /
    • pp.129-153
    • /
    • 2007
  • When English language teachers are deciding on their priorities for teaching pronunciation, it is imperative to know what kind of differences and errors are most likely to interfere with communication, and what special problems particular first-language speakers will have with English pronunciation. In other words, phoneme discrimination skill is an integral part of speech processing for the EFL learners' learning to converse in English. Training using sound visualization technique can be effective in improving second language learners' perceptions and productions of segmental and suprasegmental speech contrasts. This study assessed the efficacy of a pronunciation training that provided visual feedback for EFL learners acquiring pitch and durational contrasts to produce and perceive English phonemic distinctions. The subjects' ability to produce and to perceive novel English words was tested in two contexts before and after training; words in isolation and words in sentences. In comparison with an untrained control group, trainees showed improved perceptual and productive performance, transferred their knowledge to new contexts, and maintained their improvement three months after training. These findings support the feasibility of learner-centered programs using sound visualization technique for English language pronunciation instruction.

  • PDF

유성음 구간 검출을 위한 간단한 알고리즘에 관한 연구 (A Study on the Simple Algorithm for Discrimination of Voiced Sounds)

  • 장규철;우수영;박용규;유창동
    • 한국음향학회지
    • /
    • 제21권8호
    • /
    • pp.727-734
    • /
    • 2002
  • 본 논문에서는 유ㆍ무성음 구간을 검출하기 위한 간단한 알고리즘을 제안한다. 제안된 방법은 음성의 유ㆍ무성음의 주기성에 대한 특성을 보완할 수 있는 저대역 에너지와 영교차율, 그리고 주기성의 안정성을 판단하기 위한 피치 변화량을 파라미터로 사용하였다. 유ㆍ무성음의 구간검출을 음소단위의 검출이라는 측면에서 접근하여 음소군의 검출율과 음소군내의 음소의 검출율을 얻었다. TIMIT코퍼스 (corpus)를 데이터베이스로 사용하여 실험했을 때 유성음 음소 검출율이 약 13% 향상되었다.

프레임 기반의 포먼트 강조에 의한 음향 마스킹 현상 발생에 대한 연구 (A Study on Acoustic Masking Effect by Frame-Based Formant Enhancement)

  • 전유용;김규성;이상민
    • 대한의용생체공학회:의공학회지
    • /
    • 제30권6호
    • /
    • pp.529-534
    • /
    • 2009
  • One of the characteristics of the hearing impaired is that their frequency selectivity is poorer than that of the normal hearing. To compensate this, formant enhancement algorithms and spectral contrast enhancement algorithms have been developed. However in some cases, these algorithms fail to improve the frequency selectivity of the hearing impaired. One of the reasons is the acoustic masking among enhanced formants. In this study, we tried to enhance the formants based on the individual masking characteristic of each subject. The masking characteristic used in this study was minimum level difference (MLD) between the first formant to the second formant while acoustic masking was occurred. If the level difference between the two formants in each frame is larger than the MLD, the gain of the first formant was decreased to reduce the acoustic masking that occurred among formants. As a result of the speech discrimination test, using formant enhanced speeches, speech discrimination score (SDS) of the speeches having differently enhanced formants was significantly superior to SDS of the speeches having equally enhanced formants. It means that suppression of the acoustic masking among formants improve frequency selectivity of the hearing impaired.

Erickson의 의사소통 태도 척도(S-24)의 국내 적용을 위한 타당도 및 신뢰도 연구 (A Preliminary Study on Development Korean Version of the Modified Erickson Scale of Communication Attitudes(S-24))

  • 김효정;권도하
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.227-236
    • /
    • 2005
  • For the exact assessment and diagnosis of stuttering, not only speech disfluency but also the attitude of stuttering has to be considered. However, clinical researches and studies about stuttering have tend to center around disfluency. Relatively little attention was paid to the communication attitude of stuttering. In this paper, we will attempt to investigate that the Modified Erickson Scale of Communication Attitudes(S-24) is available in Korean stutterer. The S-24 was administrated to 27 adults with stuttering and 27 normal adults. Based on the item analysis of S-24, 4 items which have the low item discrimination coefficient and are little difference between stutterer group and normal group were excepted from the scale. To test validity of the reconstructed communication attitude scale, we estimated a internal consistency and carried out correlation analyses and discrimination analyses. We found that the reconstructed scale had a high internal consistency(a = .8701), was consisted six components(explanatory power = 66.59% of total variation), correlated with the PSI at .439 and with the SESAS at -.527, and correctly classified between stutterers and normal adults at 92.6%. Consequently, the reconstructed communication attitude scale is a useful scale to assess stutterer's communication attitude in Korea.

  • PDF

실시간 유성음 무성음 무음 식별장치의 구성에 관한 연구 (A Study on Implementation of Real Time Voiced/Unvoiced/Silence Discrimination System)

  • 방만원;최갑석
    • 대한전자공학회논문지
    • /
    • 제23권4호
    • /
    • pp.565-570
    • /
    • 1986
  • In this paper, the implementation of a voiced/unvoiced/silence discrimination system is presented. The algorithm is based on the zerocrossing rate and the spectral energy distribution of speech. In measuring zerocrossing rate, a new frequency-to-voltage conversion type interval filter is used. Expermental results show that with the proposed algorithm the effect of impulse noise in voiced intervals can be removed.

  • PDF

Enhancement of Processing Capabilities of Hippocampus Lobe: A P300 Based Event Related Potential Study

  • Benet, Neelesh;Krishna, Rajalakshmi;Kumar, Vijay
    • 대한청각학회지
    • /
    • 제25권3호
    • /
    • pp.119-123
    • /
    • 2021
  • Background and Objectives: The influence of music training on different areas of the brain has been extensively researched, but the underlying neurobehavioral mechanisms remain unknown. In the present study, the effects of training for more than three years in Carnatic music (an Indian form of music) on the discrimination ability of different areas of the brain were tested using P300 analysis at three electrode placement sites. Subjects and Methods: A total of 27 individuals, including 13 singers aged 16-30 years (mean±standard deviation, 23±3.2 years) and 14 non-singers aged 16-30 years (mean age, 24±2.9 years), participated in this study. The singers had 3-5 years of formal training experience in Carnatic music. Cortical activities in areas corresponding to attention, discrimination, and memory were tested using P300 analysis, and the tests were performed using the Intelligent Hearing System. Results: The mean P300 amplitude of the singers at the Fz electrode placement site (5.64±1.81) was significantly higher than that of the non-singers (3.85±1.60; t(25)=3.3, p<0.05). The amplitude at the Cz electrode placement site in singers (5.90±2.18) was significantly higher than that in non-singers (3.46±1.40; t(25)=3.3, p<0.05). The amplitude at the Pz electrode placement site in singers (4.94±1.89) was significantly higher than that in non-singers (3.57±1.50; t(25)=3.3, p<0.05). Among singers, the mean P300 amplitude was significantly higher in the Cz site than the other placement sites, and among non-singers, the mean P300 amplitude was significantly higher in the Fz site than the other placement sites, i.e., music training facilitated enhancement of the P300 amplitude at the Cz site. Conclusions: The findings of this study suggest that more than three years of training in Carnatic singing can enhance neural coding to discriminate subtle differences, leading to enhanced discrimination abilities of the brain, mainly in the generation site corresponding to Cz electrode placement.