Search | Korea Science

Acoustic Analysis and Melodization of Korean Intonation for Language Rehabilitation (언어재활을 위한 한국어의 음향적 분석과 선율화)

Choi, Jin Hee;Park Jeong Mi
- Journal of Music and Human Behavior
- /
- v.21 no.1
- /
- pp.49-68
- /
- 2024
This study aims to acoustically analyze Korean language characteristics and convert these findings into musical elements, providing foundational data for evidence-based music-language rehabilitation. We collected voice data from thirty men and thirty women aged 19-25, each providing six-syllable prosodic units composed of two accentual phrases, including both declarative and interrogative sentences. Analyzing this data with Praat, we extracted syllabic acoustic properties and conducted statistical analyses based on acoustic properties, sentence type, gender, and particle presence. Significant differences were found in syllable frequency and duration based on accentual phrases and prosodic units (p < .001), with interrogative showing higher frequencies and declaratives longer durations (p < .001). Female frequencies were significantly higher than males' (p < .001), with longer durations observed (p < .001). Particle syllables also showed significantly stronger intensities (p < .001). Finally, we presented melodies converted from these acoustic properties into musical scores based on pitch, duration, and accent. The insights from this analysis of six-syllable Korean sentences will guide further research on developing a system for melodizing large-scale Korean speech data, expected to be crucial in music-based language rehabilitation.
https://doi.org/10.21187/jmhb.2024.21.1.049 인용 PDF

Statistical Survey of Vocabulary in Korean Textbook for Elementary School 6th-Grade (초등학교 6학년 국어교과서의 어휘 통계조사)

Kim, Jong-Young;Kim, Cheol-Su
- The Journal of the Korea Contents Association
- /
- v.12 no.5
- /
- pp.515-524
- /
- 2012
This paper studied the statistics such as the total number of syllables, the kinds of syllables, the frequency of syllables, the number of eojeols(word phrases unique in Korean language), the kinds of eojeols, average length of eojeols, the frequency of eojeols and the parts of speech in four different Korean textbooks for 6th-grade students(6-1 Korean Reading, 6-1 Korean Speaking Listening Writing, 6-2 Korean Reading and 6-2 Korean Speaking Listening Writing). The results of the statistical survey are as follows: the number of Hangul syllables was 194,683; the kinds of syllables were 1,290; the average frequency of syllables was 150.9; the number of eojeol was 70,185; the kinds of eojeol were 22,647; the average frequency of eojeol was 3.1; the average length of eojeols was 2.8 syllables, the longest one consist of 10 syllables. In parts of speech, nouns are used more in the Korean Reading textbook, and verbs are used more in Korean Speaking Listening Writing.
https://doi.org/10.5392/JKCA.2012.12.05.515 인용 PDF KSCI

A Study on the Spectrum Variation of Korean Speech (한국어 음성의 스펙트럼 변화에 관한 연구)

Lee Sou-Kil;Song Jeong-Young
- Journal of Internet Computing and Services
- /
- v.6 no.6
- /
- pp.179-186
- /
- 2005
We can extract spectrum of the voices and analyze those, after employing features of frequency that voices have. In the spectrum of the voices monophthongs are thought to be stable, but when a consonant(s) meet a vowel(s) in a syllable or a word, there is a lot of changes. This becomes the biggest obstacle to phoneme speech recognition. In this study, using Mel Cepstrum and Mel Band that count Frequency Band and auditory information, we analyze the spectrums that each and every consonant and vowel has and the changes in the voices reftects auditory features and make it a system. Finally we are going to present the basis that can segment the voices by an unit of phoneme.
PDF

Comparison of Voice Characteristics Before and After High-Caffeine Intake (고카페인 섭취 전·후 음성 특성 비교)

Lee, Areum;Kim, Eunyun;Yoo, Hyunji;Choi, Yaelin
- Phonetics and Speech Sciences
- /
- v.7 no.4
- /
- pp.59-65
- /
- 2015
This study was conducted to identify the differences in voice characteristic variables before and after taking a certain amount of high-caffeine. Linear PCM-M10 Recorder (SONY) was used for the recorder and basic frequency of the voice (Fo), frequency fluctuation rate (jitter), amplitude fluctuation rate (shimmer) and Signal-to-Noise Ratio (SNR) were measured using TF-32(University of Wisconsin-Madison, USA). First, prolonged phonation analysis results of /ah/ by male subjects showed the shimmer values after taking high-caffeine increased statistically significantly(p<.05) compared with before the intake and SNR values significantly decreased. (p<.05). On the other hand, female subjects didn't show any statistically significant differences in all variables. Second, male subjects showed statistically significant increased shimmer values after the intake compared with before the intake at /ah/ of syllable 'na' and /ah/ in 'ra' in 'autumn' paragraph (p<.05), and jitter values significantly increased at /ah/ in 'ah' (p<.05). However, female subjects didn't show any statistically significant differences in all variables. Results of this study showed that high-caffeine intake more affects male subjects than female subjects. In male subjects, shimmer and SNR changed at vowel prolonged phonation, /ah/, and study results showed that shimmer and SNR in 'Autumn' paragraph /na/, /ra/ and jitter in /ah/ could be identified as the variables to show the voice change.
https://doi.org/10.13064/KSSS.2015.7.4.059 인용 PDF KSCI

Phonological processes of consonants from orthographic to pronounced words in the Buckeye Corpus

Yang, Byunggon
- Phonetics and Speech Sciences
- /
- v.11 no.4
- /
- pp.55-62
- /
- 2019
This paper investigates the phonological processes of consonants in pronounced words in the Buckeye Corpus and compares the frequency distribution of these processes to provide a clearer understanding of conversational English for linguists and teachers. Both orthographic and pronounced words were extracted from the transcribed label scripts of the Buckeye Corpus. Next, the phonological processes of consonants in the orthographic and pronounced labels were tabulated separately by onsets and codas, and a frequency distribution by consonant process types was examined. The results showed that the majority of the onset clusters were pronounced as the same sounds in the Buckeye Corpus. The participants in the corpus were presumed to speak semiformally. In addition, the onsets have fewer deletions than the codas, which might be related to the information weight of the syllable components. Moreover, there is a significant association and strong positive correlation between the phonological processes of the onsets and codas in men and women. This paper concludes that an analysis of phonological processes in spontaneous speech corpora can contribute to a practical understanding of spoken English. Further studies comparing the current phonological process data with those of other languages would be desirable to establish universal patterns in phonological processes.
https://doi.org/10.13064/KSSS.2019.11.4.055 인용 PDF KSCI

The Production and Perception of the Korean Stops by English Learners (영어권 화자의 국어 폐쇄음 발화와 지각)

Kim, Kee-Ho;Park, Yoon-Jin;Chun, Yun-Sil
- Speech Sciences
- /
- v.13 no.4
- /
- pp.51-67
- /
- 2006
This study examined the acoustic properties of initial stops in Korean, produced by Korean native speakers and English Korean learners. The productions of Korean native speakers were compared with those of beginners and advanced learners of Korean. Fundamental frequency(F0) and Voice Onset Time(VOT) were measured in condition of one or two syllable words, containing word-initial lenis, fortis, and aspirated stops. English Korean Learners showed that they produced stops with relatively shorter VOT and lower F0, compared with those of Korean native speakers. In case of the manner of articulation, English Korean learners have production difficulties in order of lenis stops, aspirated stops, and fortis stops. In regard to the place of articulation, English Korean learners showed production troubles in order of labial stops, velar stops, and alveolar stops. In the experiment of perception, it is hard for English Korean learners to distinguish stops of lenis and aspirated. Therefore, the results of production experiment were almost consistent with those of the perception experiment. Finally, according to both groups of proficiency, the results demonstrated that the advanced learners produce or perceive Korean stops easier than the beginners.
PDF

Phonological processes of vowels from orthographic to pronounced words in the Buckeye Corpus by sex and age groups

Yang, Byunggon
- Phonetics and Speech Sciences
- /
- v.10 no.2
- /
- pp.25-31
- /
- 2018
This paper investigated the phonological processes of monophthongs and diphthongs in the pronounced words present in the Buckeye Corpus and compared the frequency distribution of these processes by sex and age groups to provide a clearer understanding of spoken English to linguists and phoneticians. Both orthographic and pronounced words were extracted from the transcribed label scripts of the Buckeye Corpus using R. Next, the phonological processes of monophthongs and diphthongs in the orthographic and pronounced labels were tabulated using R scripts, and a frequency distribution by vowel process types, as well as sex and age groups, was created. The results revealed that 95% of the orthographic words contained the same number of syllables, whereas 5% had different numbers of vowels, thereby proving that speakers tend to preserve vowels in spontaneous speech. In addition, deletion processes were preferred in natural speech. Most vowel deletions occurred with an unstressed syllable. Chi-square tests were performed to calculate dependence in the distribution of phonological process types for male and female groups and young and old groups. The results showed a very strong correlation. This finding indicates that vowel processes occurred in approximately the same pattern in natural and spontaneous speech data regardless of sex and age, as well as whether or not the vowel processes were identical. Based on these results, the author concludes that an analysis of phonological processes in spontaneous speech corpora can greatly enhance practical understanding of spoken English.
https://doi.org/10.13064/KSSS.2018.10.2.025 인용 PDF KSCI

A Study on Objective Quality Assessment for Synthesized speech by Rule (규칙합성음의 객관적 품질평가에 관한 연구)

홍진우;김순협
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.10
- /
- pp.42-49
- /
- 1993
In this paper, we evaluate the quality of synthesized speech by rule using the LPC CD as a objective measure, and then compare the test result with the subjective one. Speech used for the test consists of 108 words which are selected by word construction method using Korean attribute and frequency distribution, synthesized by demi-syllable rule. By evaluating the quality of synthesized speech by reule objectively, we have tried to resolve the problems such as lots of evaluation time, expansion of test scale, and variables of analysis result arised by subjective measure. We have, also, proved the validity of the objective test using the LPC CD, by comparing intelligibility which is the index for the subjective quality evaluation of synthesized speech by rule with MOS. From this results, we can provide a guide for quality assessment that would be useful in the R&D of synthesis method and the commercial products using synthesized speech.
PDF

Syllable-Level Smoothing of Model Parameters for HMM-Based Mixed-Lingual Text-to-Speech (HMM 기반 혼용 언어 음성합성을 위한 모델 파라메터의 음절 경계에서의 평활화 기법)

Yang, Jong-Yeol;Kim, Hong-Kook
- Phonetics and Speech Sciences
- /
- v.2 no.1
- /
- pp.87-95
- /
- 2010
In this paper, we address issues associated with mixed-lingual text-to-speech based on context-dependent HMMs, where there are multiple sets of HMMs corresponding to each individual language. In particular, we propose smoothing techniques of synthesis parameters at the boundaries between different languages to obtain more natural quality of speech. In other words, mel-frequency cepstral coefficients (MFCCs) at the language boundaries are smoothed by applying several linear and nonlinear approximation techniques. It is shown from an informal listening test that synthesized speech smoothed by a modified version of linear least square approximation (MLLSA) and a quadratic interpolation (QI) method is preferred than that without using any smoothing technique.
PDF

Exclusion of Non-similar Candidates using Positional Accuracy based on Levenstein Distance from N-best Recognition Results of Isolated Word Recognition (레벤스타인 거리에 기초한 위치 정확도를 이용한 고립 단어 인식 결과의 비유사 후보 단어 제외)

Yun, Young-Sun;Kang, Jeom-Ja
- Phonetics and Speech Sciences
- /
- v.1 no.3
- /
- pp.109-115
- /
- 2009
Many isolated word recognition systems may generate non-similar words for recognition candidates because they use only acoustic information. In this paper, we investigate several techniques which can exclude non-similar words from N-best candidate words by applying Levenstein distance measure. At first, word distance method based on phone and syllable distances are considered. These methods use just Levenstein distance on phones or double Levenstein distance algorithm on syllables of candidates. Next, word similarity approaches are presented that they use characters' position information of word candidates. Each character's position is labeled to inserted, deleted, and correct position after alignment between source and target string. The word similarities are obtained from characters' positional probabilities which mean the frequency ratio of the same characters' observations on the position. From experimental results, we can find that the proposed methods are effective for removing non-similar words without loss of system performance from the N-best recognition candidates of the systems.
PDF

Search Result 91, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)