• Title/Summary/Keyword: Sentence Frequency

Search Result 143, Processing Time 0.021 seconds

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean (한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구)

  • Kwon, Soon-Il;Park, Ji-Hyung;Park, Neung-Soo
    • The KIPS Transactions:PartB
    • /
    • v.15B no.6
    • /
    • pp.595-602
    • /
    • 2008
  • The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.

Acoustic Analysis of the Differences of Fricatives and Affricates between Normal Children and Cleft Palate Children (구개파열 아동과 정상 아동의 마찰음과 파찰음의 음향음성학적 특성 비교)

  • You, Young-Sin;Jang, Seung-Jin;Bak, Seung-Jae;Choi, Yae-Lin
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.5
    • /
    • pp.285-295
    • /
    • 2010
  • The frequency in which noise energy is generated, that is, the point where the preceding vowel ends is the cut-off frequency. Thereupon, this study intends to examine the correlations between, cut-off frequencies, cut-off frequencies changed by the following vowel, and cut-off frequencies and nasalance score, of fricatives and affricates with the subjects of children with the cleft palate and normal children. The subjects of this study are total 12 children residing in Seoul and Gyeonggi area. Six are the children diagnosed to have the cleft palate and whose chronological age are more than six, and another six are the normal children who are also more than six and whose chronological age and sex correspond to those of the former. Each subject was presented with nonsyllable environment and sentence environment(50 environment) of fricatives and affricates. Regarding meaningless syllable environment and sentence environment of fricatives and affricates, children with the cleft palate had lower cut-off frequencies than normal children. As a result of comparative study on correlations between cut-off frequencies and nasalance score of children with the cleft palate and normal children, it doesn't show statistically significant correlations in both meaningless syllable environment and sentence environment of normal children, but it has statistically significant correlations in sentence environment of children with the cleft palate.

Sentence Similarity Analysis using Ontology Based on Cosine Similarity (코사인 유사도를 기반의 온톨로지를 이용한 문장유사도 분석)

  • Hwang, Chi-gon;Yoon, Chang-Pyo;Yun, Dai Yeol
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.441-443
    • /
    • 2021
  • Sentence or text similarity is a measure of the degree of similarity between two sentences. Techniques for measuring text similarity include Jacquard similarity, cosine similarity, Euclidean similarity, and Manhattan similarity. Currently, the cosine similarity technique is most often used, but since this is an analysis according to the occurrence or frequency of a word in a sentence, the analysis on the semantic relationship is insufficient. Therefore, we try to improve the efficiency of analysis on the similarity of sentences by giving relations between words using ontology and including semantic similarity when extracting words that are commonly included in two sentences.

  • PDF

The Production and Perception of Focus in English Yes- No Questions (영어 가부 의문문 초점 발화와 지각)

  • Jeon, Yoon-Shil;Oh, Sei-Poong;Kim, Kee-Ho
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.111-128
    • /
    • 2004
  • In English, a focused word with new information receives a pitch accent. This paper examines how English native speakers and Korean speakers produce and perceive focus in English yes-no questions. The production experiments show that native speakers realize an appropriate intonation of yes-no questions, in which a focused word has a low pitch accent followed by a high phrasal accent and a high boundary tone. However, Korean speakers usually give a high tone to a focused word. In a like manner, the perception experiments show that English native speakers judge a word with a low tone to be focused, while Korean speakers have difficulty in comprehending a focused word realized as a low tone. And it is found that Korean speakers tend to perceive low tones on sentence initial and final focused words better than those on sentence medial focused words, and they often perceive a word with a relatively high fundamental frequency or a sharp rise of fundamental frequency as a focused word. This paper shows that Korean speakers have trouble to produce and perceive an appropriate tonal pattern of a focused yes-no question, and that can cause confusion in a conversation with native speakers.

  • PDF

A study on speech analysis of person with presbycusis (노인성 난청인의 음성특성에 관한 연구)

  • Lee, S.M.;Song, C.G.;Woo, H.C.;Lee, Y.M.;Kim, W.K.
    • Proceedings of the KOSOMBE Conference
    • /
    • v.1997 no.11
    • /
    • pp.67-70
    • /
    • 1997
  • In this paper, we evaluated the character of speech of hearing impaired person (HIP) who acquire his hearing loss after the youth. It is usually observed that severe HIP decreased not only speech perception but also vocalization. so there is a need for sensitive and quantitative measures or the assesment of the speech of the HIP to serve both diagnostic and prognosic purposes, 7 HIP and 12 normal hearing person(NHP) were studied with pure tone test and speaking test using word/sentence table which consists of vowel(a:), mono and two syllables and a sentence. we analyzed formant frequency, pitch, sound intensity, speech duration of HIP and NHP speech. According to the results, in the HIP's speech we find that formant frequency was shifted, first-formant prominence was reduced, the dynamic range of sound intensity was decreased, speech duration was prolonged. In the next, we expect the correlation between hearing and speech character of HIP is cleared through analysis of more acoustic parameters and precise selection of HIP group.

  • PDF

Complex Sentence Development of Korean-Chinese Bilingual Children (한국어-중국어 이중 언어 아동의 한국어 발달 : 복문발달을 중심으로)

  • Lee, Kwee-Ok;Lee, Hae-Ryoun
    • Korean Journal of Child Studies
    • /
    • v.29 no.5
    • /
    • pp.1-12
    • /
    • 2008
  • This study investigated the development of complex sentences in the early utterances of Korean-Chinese children. The subjects were 47(20 2-year-old, 15 3-year-old, and 12 4-year-old) Korean-Chinese children living in China. Each child's spontaneous natural speech during interaction with his/her caregiver was videotaped for about 30 minutes and analyzed for Korean complex sentences using Kim's(2000) categories and Korean Computerized Language Analysis 2.0(2000). Results showed that older children were higher in Mean Length of Utterance and in number and frequency of word types than younger children. The language development of bilingual children was delayed compared with monolingual children but the developmental sequence between bilingual and monolingual children was similar.

  • PDF

Error Analysis: What Problems do Learners Face in the Production of the English Passive Voice?

  • Jung, Woo-Hyun
    • English Language & Literature Teaching
    • /
    • v.12 no.2
    • /
    • pp.19-40
    • /
    • 2006
  • This paper deals with a part-specific analysis of grammatical errors in the production of the English passive in writing. The purpose of the study is dual: to explore common error types in forming the passive; and to provide plausible sources of the errors, with special attention to the role of the native language. To this end, this study obtained a large amount of data from Korean EFL university students using an essay writing task. The results show that in forming the passive sentence, errors were made in various ways and that the most common problem was the formation of the be-auxiliary, in particular, the proper use of tense and S-V agreement. Another important finding was that the global errors found in this study were not necessarily those with the greatest frequency. Also corroborated was the general claim that many factors work together to account for errors. In many cases, interlingual and intralingual factors were shown to interact with each other to explain the passive errors made by Korean students. On the basis of the results, suggestions are made for effective and well-formed use of the passive sentence.

  • PDF

A Term Importance-based Approach to Identifying Core Citations in Computational Linguistics Articles

  • Kang, In-Su
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.9
    • /
    • pp.17-24
    • /
    • 2017
  • Core citation recognition is to identify influential ones among the prior articles that a scholarly article cite. Previous approaches have employed citing-text occurrence information, textual similarities between citing and cited article, etc. This study proposes a term-based approach to core citation recognition, which exploits the importance of individual terms appearing in in-text citation to calculate influence-strength for each cited article. Term importance is computed using various frequency information such as term frequency(tf) in in-text citation, tf in the citing article, inverse sentence frequency in the citing article, inverse document frequency in a collection of articles. Experiments using a previous test set consisting of computational linguistics articles show that the term-based approach performs comparably with the previous approaches. The proposed technique could be easily extended by employing other term units such as n-grams and phrases, or by using new term-importance formulae.

Acoustic Characteristics of Korean Alveolar Sibilant 's', 's'' according to Phonetic Contexts of Children with Cerebral Palsy (뇌성마비 아동의 음성 환경에 따른 치경마찰음 'ㅅ', 'ㅆ'의 음향학적 특성)

  • Kim, Sookhee;Kim, Hyungi
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.3-10
    • /
    • 2013
  • The purpose of this study is to analyze the acoustic characteristics of Korean alveolar sibilant sounds of children with cerebral palsy by acoustic analysis. Thirteen children with spastic cerebral palsy aging from 6 to 10 years old, were selected by an articulation test, and compared with a control group of thirty children. The meaningless monosyllable CV, disyllable VCV(/asa/) and frame sentence including target syllables CV were measured. C was from the /s, s'/, and V was from the set /a, i, u, ${\varepsilon}$, o, ɯ, ʌ/. Multi-Speech was used for data recording and analysis. As a result, the frication duration of lenis-glottalized alveolar sibilant of children with cerebral palsy was significantly shorter than that of the control group in CV, VCV and frame sentence. The vowel duration in the following lenis-glottalized alveolar sibilant of children with cerebral palsy was significantly longer than that of the control group in CV, VCV and frame sentence. The children with cerebral palsy showed frequency and intensity of friction intervals which were significantly lower than in the control group in CV, VCV and frame sentence. In the comparison of the lenis-glottalized alveolar sibilant by the children with cerebral palsy group's phonation types, the frication duration showed a significant difference between the phonation types in CV, VCV and between the phonetic contexts. The glottalized-sibilant was longer than the lenis-sibilant in all the phonetic contexts. The subsequent vowel duration showed a significant difference between the phonation types in VCV and between the phonetic contexts(p<.05). The vowel duration in the following glottalized-sibilant was longer than the vowel duration in the following lenis-sibilant in all the phonetic contexts. In the frequency there was a significant difference between the phonation types in CV, and in the intensity there was a significant difference between the phonation type in CV and VCV. The children with spastic cerebral palsy had difficulty in articulating the alveolar sibilant due to poor control ability in laryngeal, respiration and articulatory movements which require fine motor coordination. This study quantitatively analyzes the acoustic parameters of the alveolar sibilant in various phonetic contexts. Therefore, the results are expected to help provide fundamental data for an intervention of articulation treatment for children with cerebral palsy.

Performance of Pseudomorpheme-Based Speech Recognition Units Obtained by Unsupervised Segmentation and Merging (비교사 분할 및 병합으로 구한 의사형태소 음성인식 단위의 성능)

  • Bang, Jeong-Uk;Kwon, Oh-Wook
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.155-164
    • /
    • 2014
  • This paper proposes a new method to determine the recognition units for large vocabulary continuous speech recognition (LVCSR) in Korean by applying unsupervised segmentation and merging. In the proposed method, a text sentence is segmented into morphemes and position information is added to morphemes. Then submorpheme units are obtained by splitting the morpheme units through the maximization of posterior probability terms. The posterior probability terms are computed from the morpheme frequency distribution, the morpheme length distribution, and the morpheme frequency-of-frequency distribution. Finally, the recognition units are obtained by sequentially merging the submorpheme pair with the highest frequency. Computer experiments are conducted using a Korean LVCSR with a 100k word vocabulary and a trigram language model obtained by a 300 million eojeol (word phrase) corpus. The proposed method is shown to reduce the out-of-vocabulary rate to 1.8% and reduce the syllable error rate relatively by 14.0%.