• 제목/요약/키워드: Frequency of Words

검색결과 885건 처리시간 0.165초

Effects of Word Frequency on a Lenition Process: Evidence from Stop Voicing and /h/ Reduction in Korean

  • Choi, Tae-Hwan;Lim, Nam-Sil;Han, Jeong-Im
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.35-48
    • /
    • 2006
  • The present study examined whether words with higher frequency have more exposure to the lenition process such as intervocalic stop voicing or /h/ reduction in the production of the Korean speakers. Experiment 1 and Experiment 2 tested if word-internal intervocalic voicing and /h/ reduction occur more often in the words with higher frequency than less frequent words respectively. Results showed that the rate of voicing was not significantly different between the high frequency group and the low frequency group; rather both high and low frequency words were shown to be fully voiced in this prosodic position. However, intervocalic /h/s were deleted more in high frequency words than in low frequency words. Low frequency words showed that other phonetic variants such as [h] and [w] were found more often than in high frequency group. Thus the results of the present study are indefinitive as to the relationship between the word frequency and lenition with the data at hand.

  • PDF

한글 두 글자 단어와 비단어의 명명에 글자 빈도, 글자 유형과 위치가 미치는 영향 (The influence of syllable frequency, syllable type and its position on naming two-syllable Korean words and pseudo-words)

  • 신명석;박창호
    • 인지과학
    • /
    • 제35권2호
    • /
    • pp.97-112
    • /
    • 2024
  • 한글에서 글자는 단어를 구성하는 주요한 단위이다. 본 연구는 한글 글자의 속성(글자의 빈도, 모음의 유형, 받침의 유무, 글자의 위치)이 단어와 비단어의 명명에 영향을 미치는지를 알아보고자 하였다. 선형혼합 효과분석 결과 단어는 첫째 글자의 빈도가 증가할수록 명명시간이 감소하였으며, 첫째 글자에 받침이 있을 때 명명시간이 감소하였다. 또한, 단어가 횡모음 유형일 때보다 종모음 유형일 때 명명 정확률이 더 높았으며 첫째 글자에 받침이 없을 때보다 있을 때 명명시간이 감소하였다. 비단어는 첫째 글자의 빈도와 둘째 글자의 빈도가 각각 증가할수록 명명시간이 감소하였으며 명명 정확률이 더 높았다. 비단어도 횡모음 유형일 때보다 종모음 유형일 때 명명 정확률이 더 높았다. 본 연구에서 둘째 글자의 빈도 효과는 단어와 비단어에 따라 달랐지만, 첫째 글자의 빈도효과와 모음 유형의 효과는 일관되게 나타났다. 시각단어재인에 대해 연구 결과의 함의를 논의하였다.

텍스트 마이닝 처리로 품질경영학회지 연구동향 분석 (Analysis of Research Trends in Journal of Korean Society for Quality Management by Text Mining Processing)

  • 이상복
    • 품질경영학회지
    • /
    • 제47권3호
    • /
    • pp.597-613
    • /
    • 2019
  • Purpose: The purpose of this study is to analyze the trend of quality research by analyzing the entire JKSQM(Journal of the Korean Society for Quality Management). Methods: This study is to analyze the frequency of words used in the abstract of the all JKSQM by applying the text mining processing. We use wordcrowd among text mining techniques. Results: 22 words of high frequency were presented in the abstract of the paper published in the JKSQM for 42 years. The frequency of words was shown on a 10 year basis, and the four important words were plotted on a change graph for each Vol. Frequent words of each Vol. are added in the appendix. Conclusion: The main research results are as follows. First, there has been no significant change in research trends over the last 40 years. Second, the early SQC words have been widely used, and since 1990, many words such as service-oriented words have been used, indicating a change in the times. Third, the use of the words of the 4th industrial revolution since 2010 is weak. In the above analysis, the trend of quality research in Korea is within the quality category and can be considered conservative. Now, it is expected that everything will be changed in the period of the 4th Industrial Revolution, and it is time to study the direction of quality in Korea.

영어 동형이의어의 강세실현에 관한 연구 (A Study on the Stress Realization of English Homographic Words)

  • 김옥영;구희산
    • 말소리와 음성과학
    • /
    • 제2권2호
    • /
    • pp.51-60
    • /
    • 2010
  • This study is to examine how Korean speakers realize English stress on the homographic words. Experiments were performed by Korean speakers three times, before stress instruction, immediately after instruction, and six weeks after instruction. First, duration, fundamental frequency, and intensity of the vowel in a stressed syllable of three homographic words produced by Korean speakers were compared with those of native speakers of English. The result shows that when the words were used as nouns, before instruction Korean speakers had shorter duration and lower fundamental frequency in the stressed vowel than the native speakers, which indicates that Korean speakers did not assign the primary stress on the first syllable of the nouns. After instruction, the values of duration and fundamental frequency were increased and the differences between two groups were decreased. Next, the values of these stress features measured three times were analyzed in order to find out how they changed through instruction. The analysis shows that after instruction the values of three features were increased compared to the ones before instruction, and that the biggest change was in duration of the vowel and the next was fundamental frequency. Six weeks after instruction, the values of duration and intensity were decreased than those immediately after instruction. This means that instruction is helpful for Korean speakers to assign the stress for the English homographic words, and that instruction and practice are needed repeatedly.

  • PDF

저빈도어를 고려한 개념학습 기반 의미 중의성 해소 (Word Sense Disambiguation based on Concept Learning with a focus on the Lowest Frequency Words)

  • 김동성;최재웅
    • 한국언어정보학회지:언어와정보
    • /
    • 제10권1호
    • /
    • pp.21-46
    • /
    • 2006
  • This study proposes a Word Sense Disambiguation (WSD) algorithm, based on concept learning with special emphasis on statistically meaningful lowest frequency words. Previous works on WSD typically make use of frequency of collocation and its probability. Such probability based WSD approaches tend to ignore the lowest frequency words which could be meaningful in the context. In this paper, we show an algorithm to extract and make use of the meaningful lowest frequency words in WSD. Learning method is adopted from the Find-Specific algorithm of Mitchell (1997), according to which the search proceeds from the specific predefined hypothetical spaces to the general ones. In our model, this algorithm is used to find contexts with the most specific classifiers and then moves to the more general ones. We build up small seed data and apply those data to the relatively large test data. Following the algorithm in Yarowsky (1995), the classified test data are exhaustively included in the seed data, thus expanding the seed data. However, this might result in lots of noise in the seed data. Thus we introduce the 'maximum a posterior hypothesis' based on the Bayes' assumption to validate the noise status of the new seed data. We use the Naive Bayes Classifier and prove that the application of Find-Specific algorithm enhances the correctness of WSD.

  • PDF

의학 사상의 유사성은 계량 분석 될 수 있는가 - 『동의보감』과 『의학입문』, 『경악전서』를 중심으로 - (Can Similarities in Medical thought be Quantified? - Focusing on Donguibogam, Uihagibmun and Gyeongagjeonseo -)

  • 오준호
    • 대한한의학원전학회지
    • /
    • 제31권2호
    • /
    • pp.71-82
    • /
    • 2018
  • Objectives : The purpose of this study is to compare the similarities among Donguibogam(DO), Uihagibmun(UI), and Gyeongagjeonseo(GY) in order to examine whether the medical thoughts embedded in the texts can be compared in a quantitative way. Methods : Under an empirical assumption that medical thoughts can be reduced to the frequency of major key words within the text, we selected the fourteen words of the four categories that are commonly used to describe physiology and pathology in Korean medicine as key words. And the frequency of these key words was measured and compared with each other in the three important medical texts in Korea. Results : As a result of quantitative analysis based on ${\chi}^2$ statistic, the key words in the books were distributed most heterogeneously in DO and distributed most homogeneously in UI. In comparison of the similarity analyzed by the same method, DO and UI were significantly more similar than those of DO and UI. The results of the word frequency pattern and the similarities of the book contents(CBDF) show that DO is influenced by UI, and the differences between standardized residuals and homogeneity tells us that internal context of both books are constructed differently. Conclusions : These results support the results of traditional research by experts. With the above, we were able to confirm that medical thoughts can be reduced to the frequency of major key words within the text, and compared through the frequency of such key words.

우리말 어휘빈도 정보와 분절음 탈락의 관련성에 대한 연구 (Correlation between the frequency of word and the deletion of segment)

  • 차재은
    • 대한음성학회지:말소리
    • /
    • 제47호
    • /
    • pp.1-13
    • /
    • 2003
  • The purpose of this paper is to research the correlation between frequency and the deletion of /w, (equation omitted)/ in Korean. For this purpose, I select 11 words from the frequency data, then, analyze the speech of 20 speakers of standard Korean. As a result, I can find that there is correlation between the frequency and the deletion rate of segment. The rate of deletion is higher in high frequency words, while the rate of realization is higher in low frequency words. Although there is correlation between the frequency and the deletion rate of segment, the feature of segment, prosodic environments are more important in segment deletion.

  • PDF

과학적 소양 관련 논문에서 서술자의 종류와 빈도 특성 연구 (A Study of the Kinds and Frequency Characteristics of Descriptors in the Articles Related to Scientific Literacy)

  • 이명제
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제29권4호
    • /
    • pp.401-413
    • /
    • 2010
  • This study analyzed the kinds and frequencies of descriptors in 154 articles in ERIC data base on the 4th day of January in 2010. The titles of the articles includes the words, 'scientific literacy'. As each descriptor is constituted of two words and over, in this study the first word in the descriptor was defined as 'restrictive word' and the rest word(s) as 'target word(s)'. The results are as follows. First, the descriptors which show high frequencies of target words are the traditionally important themes of scientific literacy education. Target words which show relatively high frequency are 'education', 'literacy', 'instruction' and 'countries'. Low frequency word is 'curriculum', which has various restrictive words and represents wide differentiation. Second, among the descriptors which show low frequencies of target words, relatively high frequency descriptors are '(and)society', 'change', 'secondary education', 'concepts', and 'biology', which have been given more attention in scientific literacy research than the rest descriptors. Third, the number of the descriptors that shows largely distributed pattern A, which happens over 15 years continuously, is over the half of all analyzed descriptors, which shows that they have been the major objectives in researches about scientific literacy. Most descriptors of pattern A shows normal distribution of frequency or the trends of increasing frequency as the time is nearer. Fourth, The descriptors are divided into four groups according to the time span. Each research trends are as follows. In later 80s, the research which emphasizes the importance of the sociality and technology in all level school science curriculum. In later 90s the research for educational change of inquiry-centered science curriculum which considers technological literacy in social contexts. In earlier 2000s the research that scientists and science teachers develop science curricula mostly related to scientific principles and thinking in chemistry and biology especially. In later 2000s case studies which relates teaching methods and science process activities to students' attitudes, scientific concepts and curricula.

  • PDF

단어빈도와 단어규칙성 효과에 기초한 합성음 평가 (The Text-to-Speech System Assessment Based on Word Frequency and Word Regularity Effects)

  • 남기춘;최원일;김충명;최양규;김종진
    • 대한음성학회지:말소리
    • /
    • 제53호
    • /
    • pp.61-74
    • /
    • 2005
  • In the present study, the intelligibility of the synthesized speech sounds was evaluated by using the psycholinguistic and fMRI techniques. In order to see the difference in recognizing words between the natural and synthesized speech sounds, word regularity and word frequency were varied. The results of Experiment1 and Experiment2 showed that the intelligibility difference of the synthesized speech comes from word regularity. In the case of the synthesized speech, the regular words were recognized slower than the irregular words, and there was smaller activation of the auditory areas in brain for the regular words than for the irregular words.

  • PDF

단어 빈도와 음절 이웃 크기가 한국어 명사의 음성 분절에 미치는 영향 (The Effect of Word Frequency and Neighborhood Density on Spoken Word Segmentation in Korean)

  • 송진영;남기춘;구민모
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.3-20
    • /
    • 2012
  • The purpose of this study was to investigate whether a segmentation unit for a Korean noun is a 'syllable' and whether the process of segmenting spoken words occurs at the lexical level. A syllable monitoring task was administered which required participants to detect an auditorily presented target from visually presented words. In Experiment 1, syllable neighborhood density of high frequency words which can be segmented into both CV-CVC and CVC-VC were controlled. The syllable effect and the neighborhood density effect were significant, and the syllable effect emerged differently depending on the syllable neighborhood density. Similar results were obtained in Experiment 2 where low frequency words were used. The significance of word frequency effect on syllable effect was also examined. The results of Experiments 1 and 2 indicated that the segmentation unit for a Korean noun is indeed a 'syllable', and this process can occur at the lexical level.