• 제목/요약/키워드: high frequency words

검색결과 273건 처리시간 0.023초

Effects of Word Frequency on a Lenition Process: Evidence from Stop Voicing and /h/ Reduction in Korean

  • Choi, Tae-Hwan;Lim, Nam-Sil;Han, Jeong-Im
    • 음성과학
    • /
    • 제13권3호
    • /
    • pp.35-48
    • /
    • 2006
  • The present study examined whether words with higher frequency have more exposure to the lenition process such as intervocalic stop voicing or /h/ reduction in the production of the Korean speakers. Experiment 1 and Experiment 2 tested if word-internal intervocalic voicing and /h/ reduction occur more often in the words with higher frequency than less frequent words respectively. Results showed that the rate of voicing was not significantly different between the high frequency group and the low frequency group; rather both high and low frequency words were shown to be fully voiced in this prosodic position. However, intervocalic /h/s were deleted more in high frequency words than in low frequency words. Low frequency words showed that other phonetic variants such as [h] and [w] were found more often than in high frequency group. Thus the results of the present study are indefinitive as to the relationship between the word frequency and lenition with the data at hand.

  • PDF

텍스트 마이닝 처리로 품질경영학회지 연구동향 분석 (Analysis of Research Trends in Journal of Korean Society for Quality Management by Text Mining Processing)

  • 이상복
    • 품질경영학회지
    • /
    • 제47권3호
    • /
    • pp.597-613
    • /
    • 2019
  • Purpose: The purpose of this study is to analyze the trend of quality research by analyzing the entire JKSQM(Journal of the Korean Society for Quality Management). Methods: This study is to analyze the frequency of words used in the abstract of the all JKSQM by applying the text mining processing. We use wordcrowd among text mining techniques. Results: 22 words of high frequency were presented in the abstract of the paper published in the JKSQM for 42 years. The frequency of words was shown on a 10 year basis, and the four important words were plotted on a change graph for each Vol. Frequent words of each Vol. are added in the appendix. Conclusion: The main research results are as follows. First, there has been no significant change in research trends over the last 40 years. Second, the early SQC words have been widely used, and since 1990, many words such as service-oriented words have been used, indicating a change in the times. Third, the use of the words of the 4th industrial revolution since 2010 is weak. In the above analysis, the trend of quality research in Korea is within the quality category and can be considered conservative. Now, it is expected that everything will be changed in the period of the 4th Industrial Revolution, and it is time to study the direction of quality in Korea.

English vowel production conditioned by probabilistic accessibility of words: A comparison between L1 and L2 speakers

  • Jonny Jungyun Kim;Mijung Lee
    • 말소리와 음성과학
    • /
    • 제15권1호
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigated the influences of probabilistic accessibility of the word being produced - as determined by its usage frequency and neighborhood density - on native and high-proficiency L2 speakers' realization of six English monophthong vowels. The native group hyperarticulated the vowels over an expanded acoustic space when the vowel occurred in words with low frequency and high density, supporting the claim that vowel forms are modified in accordance with the probabilistic accessibility of words. However, temporal expansion occurred in words with greater accessibility (i.e., with high frequency and low density) as an effect of low phonotactic probability in low-density words, particularly in attended speech. This suggests that temporal modification in the opposite direction may be part of the phonetic characteristics that are enhanced in communicatively driven focus realization. Conversely, none of these spectral and temporal patterns were found in the L2 group, thereby indicating that even the high-proficiency L2 speakers may not have developed experience-based sensitivity to the modulation of sub-categorical phonetic details indexed with word-level probabilistic information. The results are discussed with respect to how phonological representations are shaped in a word-specific manner for the sake of communicatively driven lexical intelligibility, and what factors may contribute to the lack of native-like sensitivity in L2 speech.

과학적 소양 관련 논문에서 서술자의 종류와 빈도 특성 연구 (A Study of the Kinds and Frequency Characteristics of Descriptors in the Articles Related to Scientific Literacy)

  • 이명제
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제29권4호
    • /
    • pp.401-413
    • /
    • 2010
  • This study analyzed the kinds and frequencies of descriptors in 154 articles in ERIC data base on the 4th day of January in 2010. The titles of the articles includes the words, 'scientific literacy'. As each descriptor is constituted of two words and over, in this study the first word in the descriptor was defined as 'restrictive word' and the rest word(s) as 'target word(s)'. The results are as follows. First, the descriptors which show high frequencies of target words are the traditionally important themes of scientific literacy education. Target words which show relatively high frequency are 'education', 'literacy', 'instruction' and 'countries'. Low frequency word is 'curriculum', which has various restrictive words and represents wide differentiation. Second, among the descriptors which show low frequencies of target words, relatively high frequency descriptors are '(and)society', 'change', 'secondary education', 'concepts', and 'biology', which have been given more attention in scientific literacy research than the rest descriptors. Third, the number of the descriptors that shows largely distributed pattern A, which happens over 15 years continuously, is over the half of all analyzed descriptors, which shows that they have been the major objectives in researches about scientific literacy. Most descriptors of pattern A shows normal distribution of frequency or the trends of increasing frequency as the time is nearer. Fourth, The descriptors are divided into four groups according to the time span. Each research trends are as follows. In later 80s, the research which emphasizes the importance of the sociality and technology in all level school science curriculum. In later 90s the research for educational change of inquiry-centered science curriculum which considers technological literacy in social contexts. In earlier 2000s the research that scientists and science teachers develop science curricula mostly related to scientific principles and thinking in chemistry and biology especially. In later 2000s case studies which relates teaching methods and science process activities to students' attitudes, scientific concepts and curricula.

  • PDF

명명 과제에서 음절 토큰 및 타입 빈도 효과 (The Syllable Type and Token Frequency Effect in Naming Task)

  • 권유안
    • 인지과학
    • /
    • 제25권2호
    • /
    • pp.91-107
    • /
    • 2014
  • 음절 빈도 효과란 고빈도 음절로 시작되는 단어가 저빈도 음절로 시작되는 단어에 비해 어휘 판단 속도가 느리며 어휘 판단 오류율도 증가하는 효과를 의미한다. 이 효과를 유발하는 원인은 전체 단어 수준에서 활성화된 음절 이웃 단어의 방해로 알려져 있으며 이 방해의 크기는 표적 단어가 얼마나 많은 음절 이웃 단어를 또는 얼마나 강력한 음절 이웃 단어를 가지고 있는지에 의해 결정된다. 그러나 음절 빈도의 정의가 음절 타입 빈도와 토큰 빈도로 구분됨에도 불구하고 이를 구분하지 않고 많은 연구들이 수행되어 왔다. 최근 Conrad, Carreiras, & Jacobs(2008)에 따르면 음절 토큰 빈도는 전체 단어 처리 수준을 반영하는 변인이며 음절 타입 빈도는 하위 어휘 처리 수준의 음절 처리 수준을 반영하는 변인일 수 있다고 주장하였다. 이에 본 연구는 이들의 주장이 맞다면 음절 타입 빈도는 단어 명명 속도를 촉진 시킬 것이며 반대로 음절 토큰 빈도는 명명 시간과 관련 없을 것이라고 예측하였다. 왜냐하면 표기 심도가 얕고 음절의 경계가 명확한 언어에서 명명 과제는 전체 단어수준을 덜 참고하기 때문이었다. 실험 1결과에서 음절 토큰 빈도를 통제한 상태에서 고빈도 타입음절의 단어 명명 시간은 유의미하게 짧았다. 실험 2에서 음절 타입 빈도를 통제한 상태에서 음절토큰 빈도의 증가는 명명 시간을 역시 단축시켰다. 이에 본 연구는 음절 토큰 빈도가 하위 어휘 처리와 무관하다는 Conrad, Carreiras, & Jacobs(2008)의 주장을 반박하였다.

A Study on the Diachronic Evolution of Ancient Chinese Vocabulary Based on a Large-Scale Rough Annotated Corpus

  • Yuan, Yiguo;Li, Bin
    • 아시아태평양코퍼스연구
    • /
    • 제2권2호
    • /
    • pp.31-41
    • /
    • 2021
  • This paper makes a quantitative analysis of the diachronic evolution of ancient Chinese vocabulary by constructing and counting a large-scale rough annotated corpus. The texts from Si Ku Quan Shu (a collection of Chinese ancient books) are automatically segmented to obtain ancient Chinese vocabulary with time information, which is used to the statistics on word frequency, standardized type/token ratio and proportion of monosyllabic words and dissyllabic words. Through data analysis, this study has the following four findings. Firstly, the high-frequency words in ancient Chinese are stable to a certain extent. Secondly, there is no obvious dissyllabic trend in ancient Chinese vocabulary. Moreover, the Northern and Southern Dynasties (420-589 AD) and Yuan Dynasty (1271-1368 AD) are probably the two periods with the most abundant vocabulary in ancient Chinese. Finally, the unique words with high frequency in each dynasty are mainly official titles with real power. These findings break away from qualitative methods used in traditional researches on Chinese language history and instead uses quantitative methods to draw macroscopic conclusions from large-scale corpus.

빅 데이터를 활용한 레트로 패션과 뉴트로 패션에 대한 인식 비교 (Comparative Analysis in Perception of Retro Fashion and New-tro Fashion Using Big Data)

  • 백경자;김정미
    • 한국의상디자인학회지
    • /
    • 제25권1호
    • /
    • pp.83-96
    • /
    • 2023
  • The purpose of this study is to compare and analyze the perception of retro fashion and new-tro fashion using big data. TEXTOM allowed the collection of big data on the words 'retro fashion' and 'new-tro fashion', which was refined afterwards. As for the data collection period, Jan. 1, 2019 to Nov. 30, 2022 was set. A top 50 list of words were extracted from this data based on appearance frequency. The extracted words were processed through Network centrality analysis and CONCOR analysis using Ucinet 6. The results are as follows. 1) In retro fashion, the appearance frequency of 'style' was the highest, followed by 'sensibility', 'color', 'trend', 'fashion', and 'brand'. These words came up with high TF-IDF values. Network centrality analysis discovered that 'color', 'style', 'trend', 'sensibility', and 'design' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; trends, styles, looks, and photos. 2) In new-tro fashion, the appearance frequency of 'retro' was the highest, followed by 'trend', 'generation', 'style', 'brand', and 'fashion'. These words also came up with high TF-IDF values. Network centrality analysis found that 'retro', 'trend', 'generation', and 'brand' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; style, brand, clothing, and trend. 3) New-tro fashion is included in retro fashion in that it reproduces the styles of the past. However, it is taken completely differently from generation to generation. Unlike the older generations, millennials actively accept newly created clothes and brands based on the past styles. They perceive it as a fashion that reveals their own unique tastes and tastes.

우리말 어휘빈도 정보와 분절음 탈락의 관련성에 대한 연구 (Correlation between the frequency of word and the deletion of segment)

  • 차재은
    • 대한음성학회지:말소리
    • /
    • 제47호
    • /
    • pp.1-13
    • /
    • 2003
  • The purpose of this paper is to research the correlation between frequency and the deletion of /w, (equation omitted)/ in Korean. For this purpose, I select 11 words from the frequency data, then, analyze the speech of 20 speakers of standard Korean. As a result, I can find that there is correlation between the frequency and the deletion rate of segment. The rate of deletion is higher in high frequency words, while the rate of realization is higher in low frequency words. Although there is correlation between the frequency and the deletion rate of segment, the feature of segment, prosodic environments are more important in segment deletion.

  • PDF

단어 빈도와 음절 이웃 크기가 한국어 명사의 음성 분절에 미치는 영향 (The Effect of Word Frequency and Neighborhood Density on Spoken Word Segmentation in Korean)

  • 송진영;남기춘;구민모
    • 말소리와 음성과학
    • /
    • 제4권2호
    • /
    • pp.3-20
    • /
    • 2012
  • The purpose of this study was to investigate whether a segmentation unit for a Korean noun is a 'syllable' and whether the process of segmenting spoken words occurs at the lexical level. A syllable monitoring task was administered which required participants to detect an auditorily presented target from visually presented words. In Experiment 1, syllable neighborhood density of high frequency words which can be segmented into both CV-CVC and CVC-VC were controlled. The syllable effect and the neighborhood density effect were significant, and the syllable effect emerged differently depending on the syllable neighborhood density. Similar results were obtained in Experiment 2 where low frequency words were used. The significance of word frequency effect on syllable effect was also examined. The results of Experiments 1 and 2 indicated that the segmentation unit for a Korean noun is indeed a 'syllable', and this process can occur at the lexical level.

텍스트 마이닝을 통한 상급종합병원의 미션, 비전, 핵심가치 분석 연구 (Analysis of Mission, Vision and Core values in Korean Tertiary General Hospitals Through Text Mining)

  • 이지훈
    • 한국병원경영학회지
    • /
    • 제28권2호
    • /
    • pp.32-43
    • /
    • 2023
  • Purposes: This research is conducted to identify main features and trends of mission, vision and core values in Korean tertiary general hospitals by using text-mining. Methodology: For the study, 45 mission, 112 vision and 190 core values are collected from 45 tertiary general hospitals' homepages in 2022 and use word frequency analysis and Leyword co-occurrence analysis. Findings: In the tertiary general hospitals' mission, there are high frequency words such as 'health', 'humanity', 'medical treatment', 'education', 'research', 'happiness', 'love', 'best', 'spirit', and mission mainly includes the content of contributing humanity's health and happiness with these words. In case of vision, high frequency words are 'hospital', 'medical treatment', 'research', 'lead', 'trust', 'centered', 'patient', 'best', 'future'. By using these words in vision, it represents the definition and characteristics of vision such as ideal organizations in the future, goals and targets. As a result of the Leyword co-occurrence analysis, vision includes the content of 'high-tech medical treatment', 'special care for patients', 'leading education and research', 'the highest trust with customer', 'creative talents training'. -astly, the high frequency word-pairs in core values are 'social distribution', 'innovation pursuit', 'cooperation and harmony', and it defines standards of behavior for organizations. Practical Implication: To correct the problems of vision, mission and core values from findings, firstly, it needs for Korean tertiary general hospitals to use the words that can explain organization's identity and differentiate others in their mission. Secondly, considering strengthening the role of hospitals in their community and the importance of members in organizations, it is necessary to establish vision with considering community and members to activate vision effectively. Thirdly, because there are no specific guidelines of establishing mission, vision and core values for healthcare organizations, this research concepts and results could be utilized when other organizations establish mission, vision and core values.

  • PDF