• Title/Summary/Keyword: vowel system

Search Result 142, Processing Time 0.025 seconds

A VOWEL TRAJECTORY DISPLAY FOR SPEECH TRAINING

  • Kido, Ken'iti;Tanahashi, Kenji;Ohuchi, Yasuhiro
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.971-976
    • /
    • 1994
  • A speech display system is developed for the evaluation and the training of speech utterance. The speech is analyzed by linear predictive technique every 5 ms and the frequencies of the lowest two spectral local peaks P1 and P2 are extracted. The vowel trakectory is displayed using those frequencies on th P1-P2 plane. In most cases, P1 and P2 correspond to the first and the second formants, but in the case of indistinct utterance, the correspondence between the local spectral peaks and the formants tends to fall into disorder. And the system is considered to be useful for the evaluation of speech quality. The examples of some words uttered by normal speakers and some patients with difficulty in utterance are compared each other for the discussion of the effectiveness of the system.

  • PDF

A Study on the Acoustic Characteristics of the American Adults Using Phonetic System for Sasang Constitution (한국성인(韓國成人)의 사상체질음성분석기(絲狀體質音聲分析機)를 이용한 체질별(體質別) 음향특성(音響特性) 연구(硏究))

  • Shin, Mi-Ran;Kim, Dal-Rae;Yoo, Jun-Sang
    • Journal of Sasang Constitutional Medicine
    • /
    • v.19 no.3
    • /
    • pp.75-88
    • /
    • 2007
  • 1. Objectives The purpose of this study was to objectively diagnose American male and female's production of two vowels /a, i/ by Sasang Constitution. 2. Methods It was analyzed the constitutional characteristics of the American adults voices with PSSC-2004. of 134 cases of vowels /a, i/ with a duration of $2.5{\sim}3$ seconds were inputted in PSSC-2004 and analyzed into 40 factors. 3. Results and Conclusions 1) APQ In the male group's production of vowel /a/, the Soyangin's APQ(l), APQ(3) and APQ(4) were significantly high compared with those of Taeumin and Soeumin. 2) Shimmer In the male group's production of vowel /a/, Soeumin's Octave1 Shimmer was significantly low compared with that of Taeumin and Soeumin. In the male group's production of vowel /i/, Soeumin's D-Shimmer was significantly low compared with that of Taeumin and Soeumin. In the female group's production of vowel /a/, the Soyangin's C-Shimmer was significantly high compared with that of Taeumin and Soeumin. 3) Octave In the male group's production of vowel /a/, the Soyangin's Octave3, Octave4, Octave5, Octave6 and Octave1 Ratio were significantly high compared with those of Taeumin and Soeumin. In the male group's production of vowels /a, i/, the Soyangin's Octave4 was significantly high compared with that of Taeumin and Soeumin. 4) Energy In the male group's production of vowel /a/, the Soyangin's Time Domain Total Sum /Time Domain Count, Freq Domain Total Sum /cnt(0), 0k-4k Total Sum, Dev., A(A#, C, E, D#, E, F#) tot E, and A(C,, D#, F#) Dev. were significantly high compared with those of Taeumin and Soeumin. In the male group's production of vowel /i/, the Soyangin's Time Domain Total Sum /Time Domain Count, Freq Domain Total Sum /cnt(0) and 0k-4k Total Sum, Dev. were significantly high compared with those of Taeumin and Soeumin. 5) Peak In the male group's production of vowels /a/ and /i/,, the Soyangin's Peak1 Ratio was significantly low compared with that of Taeumin and Soeumin. In the male group's production of vowels /a/ and /i/,, the Soyangin's Peak10 Ratio, Time Domain Peak Total/Total Energy Sum, Time Domain Peak Dev. and Total/Total Dev. Sum were significantly high compared with those of Taeumin and Soeumin. 6) It is necessary to expand the research of the acoustic analysis of American and Korean to other countries in the diagnosis of the Sasang Constitution by using the voice characteristics.

  • PDF

Historic Status and Grammatical Characteristics of Korean language in the Early 20th Century (한국어사에서 20세기 초 한국어의 위상과 문법 특징)

  • Hong, Jongseon
    • Korean Linguistics
    • /
    • v.71
    • /
    • pp.1-22
    • /
    • 2016
  • The early 20th century is a period of time when Korea confronted with the surging waves of modernization, and made a variety of internal reactions. The Korean language, not immune to the upheaval, also experienced new changes and gradually gained characteristics of today's Korean. Although scholars have not yet fully agreed upon the time division of Korean, Gabo reformation (1896) is usually considered to be the beginning of modern Korean. Thus, the early 20th century was also the beginning of modern Korean. Phonological, lexical, and grammatical characteristics of modern day Korean began to appear during this period of time. Phonologically, the 10 vowel system was established, glottal sounds and aspirated sounds increased, vowel harmony declined. Phenomena such as vowel raising, front-vowelization, monophthongization, and the word-initial rule appeared. Meanwhile, hangul-Chinese mix writing became common practice, and hangul-only writing also started to take place in narrative writing, and elements of spoken language began to reflect in written language. All those pointed to the unification of written and spoken language. Under the influence of modernization, a great amount of new words appeared. Especially, Japanese and other foreign words flooded in in great quantities. Grammatically, '-eos-(-엇-), -neun-(-는-), -ges-(-겟-)' trichotomy system of tenses was established, and hearer-oriented honorific system also formed a binary system of 'hasoseo(하소서), hasibsio(하십시오), hao(하오), hage(하게), haera(해라)' and 'hae (해), haeyo(해요)'. In word formation and sentence construction, the use of '-gi(-기)' became more frequent than '-eum(-음)', while '~geot(~것)' also significantly increased. In negative, causative and passive expressions, the use of long form, which has fewer restrictions than the short form, became more frequent. A tendency towards simplicity appeared. In the same vain, long and complex sentences with several clauses tend to be avoided. Instead, short simple sentences became more favorable. Korean linguistics scholars should pay closer attention to the modernization period, which includes the early 20th century. In order to fully understand today's Korean language, more thorough research on this immediately preceding period is necessary.

A Study on Data Sharing Codes Definition of Hangul in CAI Application Programs (CAI 응용프로그램 작성시 자료공유를 위한 한글 코드 체계 정의에 관한 연구)

  • Kho, Dae-Gon
    • Journal of The Korean Association of Information Education
    • /
    • v.2 no.1
    • /
    • pp.138-161
    • /
    • 1998
  • This research addresses to establish a systematic approach to design a standard Hangul code system for educational purposes in development of CAI courseware using Korean, English, and Chinese characters, which requires data exchanges and database construction. In this paper, types of Korean alphabetic code systems already in use, their representational environments and consonant/vowel order have been studied and analysed. This paper presents the requirements that the hangul code system for educational purpose needs to obtain. First, it should be able to represent all contemporary as well as ancient Korean alphabets. Second, character elements should be separable. Third, consonant/vowel order should be determined to easily retrieve and exchange data. Lastly, the code should maintain compatibility with other national codes and provide uniqueness of user-defined character codes.

  • PDF

Consonant-Vowel Classification Based Segmentation Technique for Handwritten Off-Line Hangul (자소 클래스 인식에 의한 off-line 필기체 한글 문자 분할)

  • Hwang, Sun-Ja;Kim, Mun-Hyeon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.1002-1013
    • /
    • 1996
  • The segmentation of characters is an important step in the automatic recognition of handwritten text. This paper proposes the segmenting method of off-line handwritten Hangul. The suggested approach is based on the structural characteristics of Hangul. The first step extracts the local features. connected component and strokes from the imput word. In the second step we identify the class of strokes. The third segmenting step specifies WRC(White Run Column) before consonant or horizontal vowel. If the segment is longer than threshold, the system estimates segmenting columns using the consonant-vowel information and column features, and then finds a cornered boundary along the strokes within the estimated segmenting columns.

  • PDF

Speech Recognition of the Korean Vowel 'ㅡ' based on Neural Network Learning of Bulk Indicators (벌크 지표의 신경망 학습에 기반한 한국어 모음 'ㅡ'의 음성 인식)

  • Lee, Jae Won
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.11
    • /
    • pp.617-624
    • /
    • 2017
  • Speech recognition is now one of the most widely used technologies in HCI. Many applications where speech recognition may be used (such as home automation, automatic speech translation, and car navigation) are now under active development. In addition, the demand for speech recognition systems in mobile environments is rapidly increasing. This paper is intended to present a method for instant recognition of the Korean vowel 'ㅡ', as a part of a Korean speech recognition system. The proposed method uses bulk indicators (which are calculated in the time domain) instead of the frequency domain and consequently, the computational cost for the recognition can be reduced. The bulk indicators representing predominant sequence patterns of the vowel 'ㅡ' are learned by neural networks and final recognition decisions are made by those trained neural networks. The results of the experiment show that the proposed method can achieve 88.7% recognition accuracy, and recognition speed of 0.74 msec per syllable.

Intrinsic Fundamental Frequency(Fo) of Vowels in the Esophageal Speech (식도음성의 고유기저주파수 발현 현상)

  • 홍기환;김성완;김현기
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.2
    • /
    • pp.142-146
    • /
    • 1998
  • Background : It has been established that the fundamental frequency(Fo) of the vowels varies systemically as a function of vowel height. Specifically, high vowels have a higher Fo than low vowels. Two major explanations or hypotheses dominate contemporary accounts of fired to explain the mechanisms underlying intrinsic variation in vowel Fo, source-tract coupling hypothesis and tongue-pull hypothesis. Objectives : Total laryngectomy surgery necessiates removal of all structures between the hyoid bone and the tracheal rings. Therefore, the assumption that no direct interconnection exists between the tongue and pharyngoesophageal segment that would mediate systematic variation in vowel Fo appears quite reasonable. If tongue-pull hypothesis is correct, systemic differences in Fo between high versus low vowels produced by esophageal speakers would not Or expected. We analyzed the Fo in the vowels of esophageal voice. Materials and method : The subjects were 11 cases of laryngectomee patients with fluent esophageal voice. The five essential vowels were recorded and analyzed with computer speech analysis system(Computerized Speech Lab). The Fo was measured using acoustic waveform, automatically and manually, and narrow band spectral analysis. Results : The results of this study reveal that intrinsic variation in vowel Fo is clearly evident in esophageal speech. By analysis using acoustic waveform automatically, the signals were too irregular to measure the Fo precisely. So the data from automatic analysis of acoustic waveform is not logical. But the Fo by measuring with manually calculated acoustic waveform or narrowband spectral analysis resulted in acceptable results. These results were interpreted to support neither the source-tract coupling nor the tongue-pull hypotheses and led us to offer an alternative explanation to account for intrinsic variation of Fo.

  • PDF

A Study on the Hangul Recognition Using Hough Transform and Subgraph Pattern (Hough Transform과 부분 그래프 패턴을 이용한 한글 인식에 관한 연구)

  • 구하성;박길철
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.3 no.1
    • /
    • pp.185-196
    • /
    • 1999
  • In this dissertation, a new off-line recognition system is proposed using a subgraph pattern, neural network. After thinning is applied to input characters, balance having a noise elimination function on location is performed. Then as the first step for recognition procedure, circular elements are extracted and recognized. From the subblock HT, space feature points such as endpoint, flex point, bridge point are extracted and a subgraph pattern is formed observing the relations among them. A region where vowel can exist is allocated and a candidate point of the vowel is extracted. Then, using the subgraph pattern dictionary, a vowel is recognized. A same method is applied to extract horizontal vowels and the vowel is recognized through a simple structural analysis. For verification of recognition subgraph in this paper, experiments are done with the most frequently used Myngjo font, Gothic font for printed characters and handwritten characters. In case of Gothic font, character recognition rate was 98.9%. For Myngjo font characters, the recognition rate was 98.2%. For handwritten characters, the recognition rate was 92.5%. The total recognition rate was 94.8% with mixed handwriting and printing characters for multi-font recognition.

  • PDF

A Research on the Spoken Language in Korean Voices from Berlin: Focusing on Phonological and Morphological Features (20세기 초 베를린 한인 음원의 음운과 형태)

  • Cha, Jaeeun;Hong, Jongseon
    • Korean Linguistics
    • /
    • v.72
    • /
    • pp.257-282
    • /
    • 2016
  • The aim of this paper is to research phonological and morphological features in Korean Voices from Berlin. The Korean Voices from Berlin was recorded in 1917 at Berlin by 5 Korean prisoners engaged in World War I, some of them came from North Hamgyeong Province, the others came from Pyeongan Province, therefore these data show a North Korean regional dialect. The data are composed of three materials, counting numbers, reciting scriptures and singing folksongs. The results of this research are as follows. 1) The consonant system of Korean voices is similar to standard Korean. The 19 consonants are classified according to 5 manners of articulations and 5 points of articulations. 2) The liquid /l/ has three allophones, [ɾ] appeared in an onset position, [l] in a word medial coda position or preceded by [l], [ɹ] in a word final coda position. 3) The vowel system of Korean voices is similar to early 20th Korean's. It has 8 monophthongs, /a, ʌ, o, u, ɯ, i, e, ${\varepsilon}$/. 4) The 1 to 10 numbers in Korean voices are similar to Middle-Korean numerals. 5) The genitive particle '/ɯi/의' is pronounced [i], [ɯ], [${\varepsilon}$], especially [ɯ] is appeared in Sino Korean. 6) The /l/-deletion of conjugations are similar to Middle-Korean, /l/ deletion always occurred, if [+cor] consonants are followed.

Automatic Vowel Onset Point Detection Based on Auditory Frequency Response (청각 주파수 응답에 기반한 자동 모음 개시 지점 탐지)

  • Zang, Xian;Kim, Hag-Tae;Chong, Kil-To
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.1
    • /
    • pp.333-342
    • /
    • 2012
  • This paper presents a vowel onset point (VOP) detection method based on the human auditory system. This method maps the "perceptual" frequency scale, i.e. Mel scale onto a linear acoustic frequency, and then establishes a series of Triangular Mel-weighted Filter Bank simulate the function of band pass filtering in human ear. This nonlinear critical-band filter bank helps greatly reduce the data dimensionality, and eliminate the effect of harmonic waves to make the formants more prominent in the nonlinear spaced Mel spectrum. The sum of mel spectrum peaks energy is extracted as feature for each frame, and the instinct at which the energy amplitude starts rising sharply is detected as VOP, by convolving with Gabor window. For the single-word database which contains 12 vowels articulated with different kinds of consonants, the experimental results showed a good average detection rate of 72.73%, higher than other vowel detection methods based on short-time energy and zero-crossing rate.