Search | Korea Science

On the Control of Energy Flow between the Connection Parts of Syllables for the Korean Multi-Syllabic Speech Synthesis in the Time Domain Using Mono-syllables as a Synthesis Unit (단음절 합성단위음을 사용한 시간영역에서의 한국어 다음절어 규칙합성을 위한 음절간 접속구간에서의 에너지 흐름 제어에 관한 연구)

강찬희;김윤석
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.24 no.9B
- /
- pp.1767-1774
- /
- 1999
This paper is to synthesize Korean multi-syllabic speeches in the time domain using mono-syllables as a synthesis unit. Specially it is to control the shape forms of speech energy flows between the connection parts of syllables in the case of concatenation mono-syllables. For this it is controlled with the prosody parameters1) extracted from speech waveforms in the time domains and presented the experimental results controlled the energy flows by using the induced concatenation rules from the korean syllable shapeforms in connetion parts of syllables. In the results of experiments, it is removed the incontinuities of energy follows in the connection parts produced by concatenating the mono-syllables in the time domain and also improved the qualities and naturalites of synthesized speeches.
PDF

중국음악과 한국음악의 리듬

전인평
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.372-377
- /
- 1996
우리나라에서는 리듬을 장단이라고도 한다. 장단은 그 말이 뜻하는 바와 같이 대부분 길고 짧은 음이 모인 부가 리듬형이다. 이것은 서양음악의 분할리듬과 대조되는 것이다. 우리나라 음악의 리듬은 길고 짧은 음이 모여 장단을 이룬다. 그래서 3박은 2+1, 5박은 3+2, 8박은 5+3, 10박은 6+4, 16박은 11+5의 장단이다. 중국음악의 리듬은 대개 1자1음식(syllabic)이다. 그리고 4언1구 5언절귀 7언절귀 등 시형은 여러 가지가 있지만, 대부분 4박으로 부른다. 그래서 한국음악은 중국음악에 비하여 리듬이 복잡하다. 이것은 우리말이 중국어보다 리듬이 복잡하기 때문이다.
PDF

한국도서관을 위한 저자기호법의 연구

Lee, Jae-Cheol
- KLA journal
- /
- v.11 no.12
- /
- pp.13-15
- /
- 1970
이 글은 1970년 11월 7일 이화여자대학교 도서관에서 개최된 ‘한국도서관학회 제1회 학술발표회’에서 발표된 필자의 발표논문 요지이다. 이 논문은 원래 ‘표의 구조론에 입각한 우리 나라 저자기호법의 연구’와 ‘저자기호법에 있어서 한글의 기호삼기 문제에 대하여-기호법의 요건과 Syllabic Notation의 원리에 입각한 고찰-’과 ‘동서저자 기호표(제2판, 개정판)의 구성원리와 사용원칙’이란 제목으로 되어 있는 세편의 독립된 논문인데 동학회 학술발표회를 위하여 Ⅰ,Ⅱ,Ⅲ,으로 요약하여 발효한 것이다. 이 논문들의 지상발표는 Ⅰ은 한국도서관학회지 제1집에, Ⅱ는 연세학교 Ⅱ‘인문과학’ 제24집(1970년 12월)에, Ⅲ은 본 기관지에 각각 실릴 예정이다.
PDF

Coarticulation and vowel reduction in the neutral tone of Beijing Mandarin

Lin Maocan
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.207-207
- /
- 1996
The neutral tone is one of the most important distinguishing features in Beijing Mandarin, but there are two completely different views on its linguistic function: a special tone(Xu, 1980) versus weak stress(Chao, 1968). In this paper, the acoustic manifestation of the neutral tone will be explored to show that it is closely related to weak stress. 122 disyllabic words in which the second syllable carries the neutral tone, including 22 stress pairs, were uttered by a native male speaker of Beijing dialect and analysed by Kay Digital Sonagraph 5500-1. The results of the acoustic analysis are presented as follows: 1) The first two formants of the medial and the syllabic vowel moves towards that of central vowel with a greater magnitude in the syllable with the neutral tone than in the syllable with any of the four normal tones. Also the vowel ending, and nasal coda /n/ and / / in the syllable with the neutral tone tends to be deleted. 2) In the syllables with the neutral tone, there are strong carryover coarticulations between the medial and syllabic vowel and the preceding unvoiced consonant. In general, the vowel is affected to move towards the position of the central vowel with more greater magnitude by coronal consonant than by labial or velar consonant. 3) In the syllable with the neutral tone, when and only when it precedes a syllable with tone-4, the high vowel following [f], [ts'], [s], [ts'], [s], [tc'] or [c] tends to be voiceless. 4) It can be seen from the acoustical results of 22 stress pairs that the duration of the syllable with the neutral tone is on the average reduced to 55% of that of the syllable with the four normal tones, and the duration of the final in the syllable with neutral tone is on the average reduced to 45% of that of the final in the syllable with the four normal tones(Lin & Yan 1980). 5) The FO contour of the neutral tone is highly dependent on the preceding normal tone(Lin & Yan 1993). For a number of languages it has been found that the vowel space is reduced as the level of stress placed upon the vowel is reduced(Nord 1986). Therefore we reach the conclusion that the syllable with neutral tone is related to weak stress(Lin & Yan 1990). The neutral tone is not a special tone because the preceding normal tone.
PDF

An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus (벅아이 코퍼스에서의 젊은 성인 남성의 모음 포먼트 분석)

Yoon, Kyu-Chul;Noh, Hye-Uk
- Phonetics and Speech Sciences
- /
- v.4 no.2
- /
- pp.41-49
- /
- 2012
The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, syllabic stress information, the location in a word, location in utterance, speech rate of three consecutive words, and the word frequency in the corpus. The results indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants. The purpose of this paper is to extract the vowel formants of the ten young male speakers from the Buckeye Corpus of Conversational Speech [1] and to analyze them in comparison to earlier works in terms of various phonetic factors that are expected to affect the realization of the formant distribution. The first two formant frequency values were automatically extracted with a Praat script along with such factors as the place of articulation, the content versus function word information, the syllabic stress information, the location in a word, the location in an utterance, the speech rate of the three consecutive words, and the word frequency in the corpus. The result indicated that the formant patterns from the corpus were very different from those of earlier works although the overall pattern was similar and that the factors were strongly responsible for the realization of the two formants.
https://doi.org/10.13064/KSSS.2012.4.2.041 인용 PDF

Utilization of Syllabic Nuclei Location in Korean Speech Segmentation into Phonemic Units (음절핵의 위치정보를 이용한 우리말의 음소경계 추출)

신옥근
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.5
- /
- pp.13-19
- /
- 2000
The blind segmentation method, which segments input speech data into recognition unit without any prior knowledge, plays an important role in continuous speech recognition system and corpus generation. As no prior knowledge is required, this method is rather simple to implement, but in general, it suffers from bad performance when compared to the knowledge-based segmentation method. In this paper, we introduce a method to improve the performance of a blind segmentation of Korean continuous speech by postprocessing the segment boundaries obtained from the blind segmentation. In the preprocessing stage, the candidate boundaries are extracted by a clustering technique based on the GLR(generalized likelihood ratio) distance measure. In the postprocessing stage, the final phoneme boundaries are selected from the candidates by utilizing a simple a priori knowledge on the syllabic structure of Korean, i.e., the maximum number of phonemes between any consecutive nuclei is limited. The experimental result was rather promising : the proposed method yields 25% reduction of insertion error rate compared that of the blind segmentation alone.
PDF

An Optimality-Based Analysis of Relative Positioning of Wh-related Prepositions in English

Han-gyoo, Khym
- International Journal of Advanced Culture Technology
- /
- v.10 no.4
- /
- pp.576-582
- /
- 2022
In this paper, we discuss the relative positioning of Wh-related English prepositions in a Wh-interrogative construction within the Optimality Theory [1-2]. By employing the two key constraints such as ^*Prep-Str and Align which are developed for the positioning of Wh-related prepositions from Romance languages such as French and Italian [3] and for the positioning of Wh-related prepositions from the middle English prose from 1500 to 1900 [4-6], and by slightly modifying the constraint hierarchy of ^*Prep-STR >>Align into ^**PrepSTR <<>>Align, Choi argues that his new theory can properly explain the unique behaviors of English Whrelated prepositions being able to take two 'optional' operations such as pied-piping and stranding to find legitimate landing sites in a Wh-interrogative construction [7]. However, this new analysis again reveals the following critical problems: (1) Unlike the 'light' English Wh-related prepositions which can two optional operations for legitimate landing sites in a Wh-interrogative construction, 'heavy' Wh-related English prepositions are not allowed to have such two options: they take just one option of pied-piping only. Thus, (2) his argumentation based on the existing constraints and the modified constraint hierarchy is neither general enough nor proper to explain the issue of the relative positioning for all English Wh-related preposition cases. To include such exceptional syntactic property of the 'heavy' preposition cases within the Optimality Theory, we suggest a new constraint of ^*HPrep-STR ranked at the highest position of the constraint hierarchy to disallow a 'heavy' or multi-syllabic Wh-related English preposition to stay alone at the end of a sentence. The new final hierarchy of constraints we suggest to explain the exceptional positioning of 'heavy' Wh-related prepositions together with the other 'light' Wh-related prepositions in English Wh-interrogative construction will be as follows: ^*HPrep-STR>>Align<<>>^*Prep-STR.
https://doi.org/10.17703/IJACT.2022.10.4.576 인용 PDF KSCI

A Study of Segmental and Syllabic Intervals of Canonical Babbling and Early Speech

Chen, Xiaoxiang;Xiao, Yunnan
- Cross-Cultural Studies
- /
- v.28
- /
- pp.115-139
- /
- 2012
Interval or duration of segments, syllables, words and phrases is an important acoustic feature which influences the naturalness of speech. A number of cross-sectional studies regarding acoustic characteristics of children's speech development found that intervals of segments, syllables, words and phrases tend to change with the growing age. One hypothesis assumed that decreases in intervals would be greater when children were younger and smaller decreases in intervals when older (Thelen,1991), it has been supported by quite a number of researches on the basis of cross-sectional studies (Tingley & Allen,1975; Kent & Forner,1980; Chermak & Schneiderman, 1986), but the other hypothesis predicted that decreases in intervals would be smaller when children were younger and greater decreases in intervals when older (Smith, Kenney & Hussain, 1996). Researchers seem to come up with conflicting postulations and inconsistent results about the change trends concerning intervals of segments, syllables, words and phrases, leaving it as an issue unresolved. Most acoustic investigations of children's speech production have been conducted via cross-sectional designs, which involves studying several groups of children. So far, there are only a few longitudinal studies. This issue needs more longitudinal investigations; moreover, the acoustic measures of the intervals of child speech are hardly available. All former studies focus on word stages excluding the babbling stages especially the canonical babbling stage, but we need to find out when concrete changes of intervals begin to occur and what causes the changes. Therefore, we conducted an acoustic study of interval characteristics of segments and words concerning Canonical Babble ( CB) and early speech in an infant aged from 0;9 to 2;4 acquiring Mandarin Chinese. The current research addresses the following two questions: 1. Whether decreases in interval would be greater when children were younger and smaller when they were older or vice versa? 2. Whether the child speech concerning the acoustic features of interval drifts in the direction of the language they are exposed to? The female infant whose L1 was Southern Mandarin living in Changsha was audio- and video-taped at her home for about one hour almost on a weekly basis during her age range from 0;9 to 2;4 under natural observation by us investigators. The recordings were digitized. Parts of the digitized material were labeled. All the repetitions were excluded. The utterances were extracted from 44 sessions ranging from 30 minutes to one hour. The utterances were divided into segments as well as syllable-sized units. Age stages are 0;9-1;0,1;1-1;5, 1;6-2;0, 2;1-2;4. The subject was a monolingual normal child from parents with a good education. The infant was audio-and video-taped in her home almost every week. The data were digitized, segments and syllables from 44 sessions spanning the transition from babble to speech were transcribed in narrow IPA and coded for analysis. Babble was coded from age 0;9-1;0, and words were coded from 1;0 to 2;4, the data has been checked by two professionally trained persons who majored in phonetics. The present investigation is a longitudinal analysis of some temporal characteristics of the child speech during the age periods of 0;9-1;0, 1;1-1;5, 1;6-2;0, 2;1-2;4. The answer to Research Question 1 is that our results are in agreement with neither of the hypotheses. One hypothesis assumed that decreases in intervals would be greater when children were younger and smaller decreases in intervals when older (Thelen,1991); but the other hypothesis predicted that decreases in intervals would be smaller when children were younger and greater decreases in intervals when older (Smith, Kenney & Hussain, 1996). On the whole, there is a tendency of decrease in segmental and syllabic duration with the growing age, but the changes are not drastic and abrupt. For example, /a/ after /k/ in Table 1 has greater decrease during 1;1-1;5, while /a/ after /p/, /t/ and /w/ has greater decrease during 2;1-2;4. /ka/ has greater decrease during 1;1-1;5, while /ta/ and /na/ has greater decrease during 2;1-2;4.Across the age periods, interval change experiences lots of fluctuation all the time. The answer to Research Question 2 is yes. Babbling stage is a period in which the children's acoustic features of intervals of segments, syllables, words and phrases is shifted in the direction of the language to be learned, babbling and children's speech emergence is greatly influenced by ambient language. The phonetic changes in terms of duration would go on until as late as 10-12 years of age before reaching adult-like levels. Definitely, with the increase of exposure to ambient language, the variation would be less and less until they attain the adult-like competence. Via the analysis of the SPSS 15.0, the decrease of segmental and syllabic intervals across the four age periods proves to be of no significant difference (p>0.05). It means that the change of segmental and syllabic intervals is continuous. It reveals that the process of child speech development is gradual and cumulative.

Named Entity Recognition using CNN for Korean syllabic character. (음절 기반의 CNN를 이용한 개체명 인식)

Park, Hye-woong;Song, Young-Sook
- Annual Conference on Human and Language Technology
- /
- 2017.10a
- /
- pp.330-332
- /
- 2017
개체명 인식(Named Entity Recognition, 이하 NER)은 인명(PS), 기관명(OG), 장소(LC), 날짜(DT), 시간(TI) 등에 해당하는 개체명에 일정한 태깅 값을 주어 그 정보를 가시화하는 작업이다. 한국어 개체명 인식은 아직 그 자질이 충분히 밝혀져 있지 않아 자연어 처리 분야의 발전을 더디게 하는 한 요소로 작용하고 있다. 한국어가 음절 기반으로 단어를 형성하고 비교적 어순이 자유롭다는 특성이 있기에, 이런 특징을 잘 포착할 수 있는 "음절 기반의 Convolutional Neural Network(CNN)"의 아키텍쳐를 제안하여 66.80%의 성능을 보였다. 이 방법을 사용하면 형태소 분석등 개체명 이전 단계에서 발생하는 오류에 의해 개체명 인식(NER)의 성능이 떨어지는 문제를 해결할 수 있고, 조사나 어미 등을 제거하기 위한 후처리를 생략할 수 있다.
PDF

Korean Compound Noun Decomposition Only Using Syllabic Information (음절 정보만 이용한 한국어 복합 명사 분해)

Park, Seong-Bae;Zhang, Byoung-Tak
- Annual Conference on Human and Language Technology
- /
- 2003.10d
- /
- pp.33-39
- /
- 2003
한국어에서는 복합 명사 생성이 매우 자유스럽다. 즉, 독립된 명사를 연속으로 붙여 쓰는 것이 가능하다. 하지만, 기계번역이나 정보 검색과 같이 복합 명사를 처리하는 시스템에서 정확한 분석을 위해서는 복합 명사를 다시 단일 명사들로 분해하는 과정이 필요하다. 본 논문에서는 한국어 복합 명사 분해를 위해 GECORAM(GEneralized Combination of Rule-based learning And Memory-based learning) 알고리듬을 제시한다. 규칙 학습 알고리듬의 장점은 생성된 학습 결과를 사람이 쉽게 이해할 수 있다는 점이지만, 다른 지도학습 알고리듬에 비해 성능이 떨어진다는 단점이 있다. 본 논문에서는 이를 위해 규칙 학습 알고리듬과 기억기반 학습을 결합하는 방법을 제시한다. 실험 결과, GECORAM 알고리듬은 규칙 기반 학습이나 기억 기반 학습을 단독으로 쓰는 경우보다 높은 정확도를 보였다.
PDF

Search Result 79, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)