• Title/Summary/Keyword: dialect

Search Result 159, Processing Time 0.03 seconds

Performance Comparison of Korean Dialect Classification Models Based on Acoustic Features

  • Kim, Young Kook;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.10
    • /
    • pp.37-43
    • /
    • 2021
  • Using the acoustic features of speech, important social and linguistic information about the speaker can be obtained, and one of the key features is the dialect. A speaker's use of a dialect is a major barrier to interaction with a computer. Dialects can be distinguished at various levels such as phonemes, syllables, words, phrases, and sentences, but it is difficult to distinguish dialects by identifying them one by one. Therefore, in this paper, we propose a lightweight Korean dialect classification model using only MFCC among the features of speech data. We study the optimal method to utilize MFCC features through Korean conversational voice data, and compare the classification performance of five Korean dialects in Gyeonggi/Seoul, Gangwon, Chungcheong, Jeolla, and Gyeongsang in eight machine learning and deep learning classification models. The performance of most classification models was improved by normalizing the MFCC, and the accuracy was improved by 1.07% and F1-score by 2.04% compared to the best performance of the classification model before normalizing the MFCC.

Sociolinguistic variation of length in Seoul dialect (서울말 장단의 사회언어학적 변이에 관한 연구 - 연령별 변이를 중심으로 -)

  • Kim, Seon-Cheol;Kwon, Mi-Yeong;Hwang, Yeon-Sin
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.147-159
    • /
    • 2004
  • The aim of this paper is to show what are the sociolinguistic variables of length loss in Seoul dialect. 350 people were inquired to pronunce 40words. Among the informants, 152 were male, and198 were female. In terms of their age, 49 were twenties, 70 were thirties, 69 were forties, 71 were fifties, and 91 were above sixties. According to our statistics, 18 words show sociolinguistic variation by age, and sex was not a variable. So we can conclude that Seoul dialect is undergoing length loss by age at least. But we need to enlarge the number of words and informants and we also need to adopt other variables.

  • PDF

An Acoustic Study of the Pronunciation of English Pitch. Accents Uttered by Korean Speakers (한국인의 영어피치악센트 발음에 관한 연구)

  • Koo, Hee-San
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.223-236
    • /
    • 2003
  • The purpose of this experimental study is to investigate characteristics of English pitch accents uttered by Korean speakers. Six English sentences were uttered five times by fifteen male undergraduate and graduate students from three groups, Seoul, Yongnam and Honam dialect speakers. We compared the subjects' data with the data of a native speaker of English as model pronunciation of English pitch accents. Acoustic features(Fo, duration, amplitude) were measured from sound spectrograms made by the PC Works. Results showed that (1) acoustic features of English pitch accents are Fo and duration for the native speaker and Korean speakers altogether, (2) Seoul dialect speakers uttered English pitch accents more similarly to the English native speaker than the other dialect speakers and (3) Korean speakers generally have difficulties in pronouncing L* accents. It appears that Korean speakers have more problems in pronouncing L* accents than H* accents.

  • PDF

The acoustic realization of the Korean sibilant fricative contrast in Seoul and Daegu

  • Holliday, Jeffrey J.
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.67-74
    • /
    • 2012
  • The neutralization of /$s^h$/ and /$s^*$/ in Gyeongsang dialects is a culturally salient stereotype that has received relatively little attention in the phonetic literature. The current study is a more extensive acoustic comparison of the sibilant fricative productions of Seoul and Gyeongsang dialect speakers. The data presented here suggest that, at least for young Seoul and Daegu speakers, there are few inter-dialectal differences in sibilant fricative production. These conclusions are supported by the output of mixed effects logistic regression models that used aspiration duration, spectral mean of the frication noise, and H1-H2 of the following vowel to predict fricative type in each dialect. The clearest dialect difference was that Daegu speakers' /$s^h$/ and /$s^*$/ productions had overall shorter aspiration durations than those of Seoul speakers, suggesting the opposite of the traditional "/$s^*$/ produced as [$s^h$]" stereotype of Gyeongsang dialects. Further work is needed to investigate whether /$s^h/-/s^*$/ neutralization in Daegu is perceptual rather than acoustic in nature.

Extracting Speech Parameters for intonational Differences between the Seoul Dialect and the other Dialects of Korean (서울말과 방언사이의 억양차이 파라미터 추출)

  • Lee, Kang-Hee
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.01a
    • /
    • pp.211-212
    • /
    • 2016
  • 음성 인식 기술은 상당 기간 연구 개발이 계속적으로 되었는데 최근에 이르러 스마트 폰이 급속히 확산되면서 그 필요성, 즉, 고품질의 상용 서비스에 대한 욕구가 널리 확산되고 있다. 이런 환경에서도 사실 한국어는 상대적으로 소홀히 다뤄 질 수 있는 가능성이 매우 높다. 이러한 소외는 기술적인 문제로만 남겨지는 것이 아니라 언어라는 측면에서 문화와 직결된다. 이에 한국어 음성 인식 연구는 꼭 필요한 것이고 많은 부분 국가가 정책적으로 지원을 하는 것이 마땅하나 현 상황은 많이 미흡하나 아마도 곧 그 필요성이 대두 될 것이라 예상하며 그를 준비하는 연구로 특화된 분야, 즉, 표준어와 방언들 그리고 감정 표현 언어에 관한 연구를 한다.

  • PDF

-eullanjira Construction of the Southwestern Dialect in Korea (서남방언의 '-을란지라' 구문 연구)

  • KIM, Ji-eun
    • Korean Linguistics
    • /
    • v.74
    • /
    • pp.1-24
    • /
    • 2017
  • This paper investigated -eullanjira sentence as a kind of construction of the Southwestern dialect in Korea. Five informants were selected to form the main corpus of -eullanjira. Through analyzing the corpus, its semantic, syntactic and morphological characteristics were figured out. Firstly, a view of construction grammar was adopted to capture the semantic and syntactic characteristics of -eullanjira. The construction of -eullanjira was established as "Xdo Yeullanjira Z". Syntactically, -do was found to be a common auxiliary particle, which allowed nouns, adverbs, verbs and adjectives to appear at the position of X, while only verbs and adjectives could appear at the position of Y. Subject-honorific, causative and passive prefinal endings could coexist with Y, while tense and modal prefinal endings could not. Z was an embedded clause, which had the semantic feature of [-DOUBT], meaning 'it should be done undoubtedly'. The formation of -eullanjira was next examined both diachronically and synchronically. It was found there was a conjuntive ending of Middle Korean, corresponding -eullanjira, namely, -landai. Finally, -eullanjira was newly analyzed as [[-eulla-]+[-n-ji-ra]].

A Study on Dialect Expression in Korean-Based Speech Recognition (한국어 기반 음성 인식에서 사투리 표현에 관한 연구)

  • Lee, Sin-hyup
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.333-335
    • /
    • 2022
  • The development of speech recognition processing technology has been applied and used in various video and streaming services along with STT and TTS technologies. However, there are high barriers to clear written expression due to the use of dialects and overlapping of stop words, exclamations, and similar words for voice recognition of actual conversation content. In this study, for ambiguous dialects in speech recognition, we propose a speech recognition technology that applies dialect key word dictionary processing method by category and dialect prosody as speech recognition network model properties.

  • PDF

VOT comparison between Seoul and Kyungsang dialects (경상 방언과 서울 방언의 VOT 지속 시간에 대한 비교 연구)

  • Jo Min-ha;Shin Ji-young
    • MALSORI
    • /
    • no.46
    • /
    • pp.1-11
    • /
    • 2003
  • This study examines the acoustic characteristics of Korean stops of two dialects, Seoul and Kyungsang, focusing on VOT(Voice Onset Time). 8 speakers of these two dialects were asked to read 590 words which contain the stops of different places of articulation and phonation types. The results showed that overall the VOTs of Kyungsang dialect were shorter than those of Seoul dialect. This was more prominent in lenis stops than in fortis or aspirated stops. It was also shown that there were significant VOT overlapping differences between the two dialects.

  • PDF

Acoustic characteristics of Stops in Seoul and Daegu dialects (서울 방언과 대구 방언 파열음의 음향 특징)

  • Jo, Min-Ha;Shin, Ji-Young
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.139-142
    • /
    • 2004
  • This study examines the acoustic characteristics of Korean stops of two dialect, Seoul and Daegu, 20 speakers of these two dialects were asked to read 15 words containing the stops of different places of articulation and phonation types at initial. The stops in the two dialects show mainly two acoustic differences. Firstly, There was a difference in distinctive features for phonetic types in the two dialects. Secondly, lenis revel fortis`s characters in Daegu dialect.

  • PDF

On inclinational values and phonological representations of sentence-final rising tones of interrogatives in Seoul dialect (서울말 의문문에 나타나는 문미 상승억양의 기울기 값과 음운표시)

  • Kim Seon Cheol
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.548-553
    • /
    • 1996
  • The sentence-final rising tones of interrogatives in Seoul dialect are contrastive at lesat in two groups. One is H-type which ends in a lower pitch, the other HH-type ending in a higher one. But they are different in terms of inclinational value, also. And the latter is more adequate criterion than the former which would be available in an ideal condition.

  • PDF