• Title/Summary/Keyword: Dialects

Search Result 79, Processing Time 0.024 seconds

Speech Rate Variation in Synchronous Speech (동시발화에 나타나는 발화 속도 변이 분석)

  • Kim, Miran;Nam, Hosung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.4
    • /
    • pp.19-27
    • /
    • 2012
  • When two speakers read a text together, the produced speech has been shown to reduce a high degree of variability (e.g., pause duration and placement, and speech rate). This paper provides a quantitative analysis of speech rate variation exhibited in synchronous speech by examining the global and local patterns in two dialects of Mandarin Chinese (Taiwan and Shanghai). We analyzed the speech data in terms of mean speech rate and the reference of "Just Noticeable difference (JND)" within a subject and across subjects. Our findings show that speakers show lower and less variable speech rates when they read a text synchronously than when they read alone. This global pattern is observed consistently across speakers and dialects maintaining the unique local variation patterns of speech rate for each dialect. We conclude that paired speakers lower their speech rates and decrease the variability in order to ensure the synchrony of their speech.

A Study of Deletion of the Cairene Arabic Glottal Stop /'/ (카이로 아랍어에서의 성문폐쇄음의 탈락에 대한 고찰)

  • Lee Gyu-Cheol
    • MALSORI
    • /
    • no.31_32
    • /
    • pp.83-95
    • /
    • 1996
  • The purpose of this paper is to study deletion of the Arabic glottal stop /'/ in Literary Arabic(LA) and Cairene Arabic(CA). Arabic has a diglossia structure - Literary and Colloquial Arabic. The former is the standard and written language, while the latter is the oral language used in dialects of many areas. Of the various Colloquial Arabic dialects the Cairene Arabic is the most influential and powerful dialect, hence it was chosen as the subject study. In this paper the followings are found: (1) The deletion of the Arabic glottal stop /'/ in CA is found much more frequently than that in LA (2) In a word the deletion is found more frequently in the middle position than in the initial or final position, in which /'/ is sometimes converted weak consonants /w/ or /y/.

  • PDF

A Study on Dialect Expression in Korean-Based Speech Recognition (한국어 기반 음성 인식에서 사투리 표현에 관한 연구)

  • Lee, Sin-hyup
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.333-335
    • /
    • 2022
  • The development of speech recognition processing technology has been applied and used in various video and streaming services along with STT and TTS technologies. However, there are high barriers to clear written expression due to the use of dialects and overlapping of stop words, exclamations, and similar words for voice recognition of actual conversation content. In this study, for ambiguous dialects in speech recognition, we propose a speech recognition technology that applies dialect key word dictionary processing method by category and dialect prosody as speech recognition network model properties.

  • PDF

Performance Comparison of Korean Dialect Classification Models Based on Acoustic Features

  • Kim, Young Kook;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.10
    • /
    • pp.37-43
    • /
    • 2021
  • Using the acoustic features of speech, important social and linguistic information about the speaker can be obtained, and one of the key features is the dialect. A speaker's use of a dialect is a major barrier to interaction with a computer. Dialects can be distinguished at various levels such as phonemes, syllables, words, phrases, and sentences, but it is difficult to distinguish dialects by identifying them one by one. Therefore, in this paper, we propose a lightweight Korean dialect classification model using only MFCC among the features of speech data. We study the optimal method to utilize MFCC features through Korean conversational voice data, and compare the classification performance of five Korean dialects in Gyeonggi/Seoul, Gangwon, Chungcheong, Jeolla, and Gyeongsang in eight machine learning and deep learning classification models. The performance of most classification models was improved by normalizing the MFCC, and the accuracy was improved by 1.07% and F1-score by 2.04% compared to the best performance of the classification model before normalizing the MFCC.

A Comparative Study of the Speech Signal Parameters for the Consonants of Pyongyang and Seoul Dialects - Focused on "ㅅ/ㅆ" (평양 지역어와 서울 지역어의 자음에 대한 음성신호 파라미터들의 비교 연구 - "ㅅ/ ㅆ"을 중심으로)

  • So, Shin-Ae;Lee, Kang-Hee;You, Kwang-Bock;Lim, Ha-Young
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.6
    • /
    • pp.927-937
    • /
    • 2018
  • In this paper the comparative study of the consonants of Pyongyang and Seoul dialects of Korean is performed from the perspective of the signal processing which can be regarded as the basis of engineering applications. Until today, the most of speech signal studies were primarily focused on the vowels which are playing important role in the language evolution. In any language, however, the number of consonants is greater than the number of vowels. Therefore, the research of consonants is also important. In this paper, with the vowel study of the Pyongyang dialect, which was conducted by phonological research and experimental phonetic methods, the consonant studies are processed based on an engineering operation. The alveolar consonant, which has demonstrated many differences in the phonetic value between Pyongyang and Seoul dialects, was used as the experimental data. The major parameters of the speech signal analysis - formant frequency, pitch, spectrogram - are measured. The phonetic values between the two dialects were compared with respect to /시/ and /씨/ of Korean language. This study can be used as the basis for the voice recognition and the voice synthesis in the future.

A trend of Open rounded back /a:/ distribution in American English (미국영어에서의 후설원순저모음의 사용분포와 확산추이)

  • Park, Choong-Yon
    • Lingua Humanitatis
    • /
    • v.8
    • /
    • pp.313-329
    • /
    • 2006
  • In some American English dialects, the vowels /a:/ or /c:/ have been replaced with /a:/, as in caught /ka:t/ dog /da:g/ that were pronounced in /kc:t/, and /dc:g/ previously. General American does not have /a:/ in its vowel system. But in East American and Western Pensylvania, cot and caught are homophones, /ka:t/, and similarly with other pairs such as collar vs. caller, stock vs. stalk, don vs. down, knotty vs. naughty. The use of /a/ or /c/ is quite unstable, and is a well-known diagnostic for distinguishing the northern speech area of the United States from the midland and southern area. For an increasing number of Americans, however, entirely lack the opposition between /a/ and /c/, merging to /a/, referring to 'both' of the vowels of lot. This paper investigates the use of /b:/ in American dialects and its relationship with /a:/, /c:/ in AE, and with short /a/ in RP. Examining the isoglosses of the use of /a:/ in various databases of the phonological atlas of North America, this paper discusses the use, position, and trend of merger to /a:/ from the vowels of /a:/ and /c:/ in Current American English.

  • PDF

Developing a Korean Standard Speech DB (한국인 표준 음성 DB 구축)

  • Shin, Jiyoung;Jang, Hyejin;Kang, Younmin;Kim, Kyung-Wha
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.139-150
    • /
    • 2015
  • The data accumulated in this database will be used to develop a speaker identification system. This may also be applied towards, but not limited to, fields of phonetic studies, sociolinguistics, and language pathology. We plan to supplement the large-scale speech corpus next year, in terms of research methodology and content, to better answer the needs of diverse fields. The purpose of this study is to develop a speech corpus for standard Korean speech. For the samples to viably represent the state of spoken Korean, demographic factors were considered to modulate a balanced spread of age, gender, and dialects. Nine separate regional dialects were categorized, and five age groups were established from individuals in their 20s to 60s. A speech-sample collection protocol was developed for the purpose of this study where each speaker performs five tasks: two reading tasks, two semi-spontaneous speech tasks, and one spontaneous speech task. This particular configuration of sample data collection accommodates gathering of rich and well-balanced speech-samples across various speech types, and is expected to improve the utility of the speech corpus developed in this study. Samples from 639 individuals were collected using the protocol. Speech samples were collected also from other sources, for a combined total of samples from 1,012 individuals.

Experimental Phonetic Study of Kyungsang and Cholla Dialect Using Power Spectrum and Laryngeal Fiberscope (파워스펙트럼 및 후두내시경을 이용한 방언 음성(方言 音聲)의 실험적 연구(實驗的 硏究): 경상방언 및 전라방언을 중심으로)

  • Kim, Hyun-Gi;Lee, Eung-Young;Hong, Ki-Hwan
    • Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.25-47
    • /
    • 2002
  • Human language activity in the information society has been developing the communication system between humans and machines. The aim of this study was to analyze dialectal speech in Korea. One hundred Kyungsang and one hundred Cholla informants participated in this study. A CSL and Flexible laryngeal fiberscope were used for analysis of the acoustic and glottal gestures of all the vowels and consonants. Test words were made on the picture cards and letter cards which contained each vowel and each consonant, respectively. The dialogue between the examiner and the informants was recorded in a question and answer manner. The acoustic results of two dialects were as follows: Kyungsang and Cholla informants showed neutralization between /e/ and /$\varepsilon$. However, the apertures of Kyungsang vowels /i, w, u, o/ were higher than those of Cholla vowels. The /wi/ and /$\varepsilon$/ of Kyungsang Diphthong vowels were shown as simple vowels /i/ and /$\varepsilon$/ in Cholla dialect. The VOT of Cholla dilaect was longer than that of Kyungsang dialect. The fricative frequence of Kyurlgsang dialect was about 1000Hz higher than that of Cholla dialect. The glottal widths on fiberscopic images showed that the consonant durations of Kyungsang and Cholla dialects were correlated all together with the acoustic duration on the spectrogram.

  • PDF

Developing a Korean standard speech DB (II) (한국인 표준 음성 DB 구축(II))

  • Shin, Jiyoung;Kim, KyungWha
    • Phonetics and Speech Sciences
    • /
    • v.9 no.2
    • /
    • pp.9-22
    • /
    • 2017
  • The purpose of this paper is to report the whole process of developing Korean Standard Speech Database (KSS DB). This project is supported by SPO (Supreme Prosecutors' Office) research grant for three years from 2014 to 2016. KSS DB is designed to provide speech data for acoustic-phonetic and phonological studies and speaker recognition system. For the samples to represent the spoken Korean, sociolinguistic factors, such as region (9 regional dialects), age (5 age groups over 20) and gender (male and female) were considered. The goal of the project is to collect over 3,000 male and female speakers of nine regional dialects and five age groups employing direct and indirect methods. Speech samples of 3,191 speakers (2,829 speakers and 362 speakers using direct and indirect methods, respectively) are collected and databased. KSS DB designs to collect read and spontaneous speech samples from each speaker carrying out 5 speech tasks: three (pseudo-)spontaneous speech tasks (producing prolonged simple vowels, 28 blanked sentences and spontaneous talk) and two read speech tasks (reading 55 phonetically and phonologically rich sentences and reading three short passages). KSS DB includes a 16-bit, 44.1kHz speech waveform file and a orthographic file for each speech task.

Research on the Bottom Boundary Line on the Southeast Area of the Chungcheongdo Dialect in Yeongdong (영동지역어내의 충청방언 남동부 하한선 연구)

  • Seong, Hee-Jae
    • Lingua Humanitatis
    • /
    • v.8
    • /
    • pp.265-289
    • /
    • 2006
  • The geographical characteristics of Yeongdong(永同) the southernmost part of the Chungcheongbukdo province, has attracted attention among the academic circle as one of the dialectal contact regions since it adjoins the Gyeongsang and Jeolla dialects. Unlike the local language in Mooju (Jellado dialect) adjacent to the Southwest part, the local language in Yeongdong is quite different from that of Kimcheon (Gyeongsang dialect). More specifically, it is noteworthy that the boundary line of the Gyeongsang dialect is found in this region, which is different from the administrative division. In other words, the local language in Yeongdong is divided into the Chungcheong dialect and the Gyeongsang dialect, and furthermore each dialect region still has the characteristics of the other region's dialect. For example, the phonological structure of Yeongdong Chungcheongdo dialect has very unique characteristics of the fudged dialect, which is seemingly influenced by the Gyeongsang dialect. The present study is to define the bottom boundary line of the southeast area of the Chungcheong dialect by identifying the boundary line between the Gyeongsang dialect and the Chungcheong dialect, and to clarify its specific sound system generated by the contact of these two dialects. For this, the author collected and analyzed data of the local language around Yeongdong and adjacent areas. It was found that Cheongwha-ri, Deokjin-ri, and Sanjeo-ri at Yeongsan-myeon, and Mugeunjeom, Sangga-ri, and Jungga-ri at Yeongdong-eup, among the regions that belongs to Chungcheong dialect within the local language of Yeongdong, show the characteristics of the Gyeongsang dialect. Accordingly, the western areas of these villages become the southeast boundary line of the Chungcheong dialect. Also, the unique phonological characteristics of the Yeongdong Chungcheong dialect is affected by the Gyeongsang dialect, among which "rhythms, y deletion, nasal phoneme deletion, and w deletion" appeared. It is thought to be the unique fudged dialectal phenomenon that appeared only in this region. The research result is expected to be of some help in finding out various aspects of dialectal contacts as well as clarifying the phonological features of the local language in Yeongdong, and thereby contributing to exact divisioning of the Chungcheong dialect.

  • PDF