• Title/Summary/Keyword: high frequency words

Search Result 273, Processing Time 0.028 seconds

Effects of Word Frequency on a Lenition Process: Evidence from Stop Voicing and /h/ Reduction in Korean

  • Choi, Tae-Hwan;Lim, Nam-Sil;Han, Jeong-Im
    • Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.35-48
    • /
    • 2006
  • The present study examined whether words with higher frequency have more exposure to the lenition process such as intervocalic stop voicing or /h/ reduction in the production of the Korean speakers. Experiment 1 and Experiment 2 tested if word-internal intervocalic voicing and /h/ reduction occur more often in the words with higher frequency than less frequent words respectively. Results showed that the rate of voicing was not significantly different between the high frequency group and the low frequency group; rather both high and low frequency words were shown to be fully voiced in this prosodic position. However, intervocalic /h/s were deleted more in high frequency words than in low frequency words. Low frequency words showed that other phonetic variants such as [h] and [w] were found more often than in high frequency group. Thus the results of the present study are indefinitive as to the relationship between the word frequency and lenition with the data at hand.

  • PDF

Analysis of Research Trends in Journal of Korean Society for Quality Management by Text Mining Processing (텍스트 마이닝 처리로 품질경영학회지 연구동향 분석)

  • Ree, Sangbok
    • Journal of Korean Society for Quality Management
    • /
    • v.47 no.3
    • /
    • pp.597-613
    • /
    • 2019
  • Purpose: The purpose of this study is to analyze the trend of quality research by analyzing the entire JKSQM(Journal of the Korean Society for Quality Management). Methods: This study is to analyze the frequency of words used in the abstract of the all JKSQM by applying the text mining processing. We use wordcrowd among text mining techniques. Results: 22 words of high frequency were presented in the abstract of the paper published in the JKSQM for 42 years. The frequency of words was shown on a 10 year basis, and the four important words were plotted on a change graph for each Vol. Frequent words of each Vol. are added in the appendix. Conclusion: The main research results are as follows. First, there has been no significant change in research trends over the last 40 years. Second, the early SQC words have been widely used, and since 1990, many words such as service-oriented words have been used, indicating a change in the times. Third, the use of the words of the 4th industrial revolution since 2010 is weak. In the above analysis, the trend of quality research in Korea is within the quality category and can be considered conservative. Now, it is expected that everything will be changed in the period of the 4th Industrial Revolution, and it is time to study the direction of quality in Korea.

English vowel production conditioned by probabilistic accessibility of words: A comparison between L1 and L2 speakers

  • Jonny Jungyun Kim;Mijung Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigated the influences of probabilistic accessibility of the word being produced - as determined by its usage frequency and neighborhood density - on native and high-proficiency L2 speakers' realization of six English monophthong vowels. The native group hyperarticulated the vowels over an expanded acoustic space when the vowel occurred in words with low frequency and high density, supporting the claim that vowel forms are modified in accordance with the probabilistic accessibility of words. However, temporal expansion occurred in words with greater accessibility (i.e., with high frequency and low density) as an effect of low phonotactic probability in low-density words, particularly in attended speech. This suggests that temporal modification in the opposite direction may be part of the phonetic characteristics that are enhanced in communicatively driven focus realization. Conversely, none of these spectral and temporal patterns were found in the L2 group, thereby indicating that even the high-proficiency L2 speakers may not have developed experience-based sensitivity to the modulation of sub-categorical phonetic details indexed with word-level probabilistic information. The results are discussed with respect to how phonological representations are shaped in a word-specific manner for the sake of communicatively driven lexical intelligibility, and what factors may contribute to the lack of native-like sensitivity in L2 speech.

A Study of the Kinds and Frequency Characteristics of Descriptors in the Articles Related to Scientific Literacy (과학적 소양 관련 논문에서 서술자의 종류와 빈도 특성 연구)

  • Lee, Myeong-Je
    • Journal of Korean Elementary Science Education
    • /
    • v.29 no.4
    • /
    • pp.401-413
    • /
    • 2010
  • This study analyzed the kinds and frequencies of descriptors in 154 articles in ERIC data base on the 4th day of January in 2010. The titles of the articles includes the words, 'scientific literacy'. As each descriptor is constituted of two words and over, in this study the first word in the descriptor was defined as 'restrictive word' and the rest word(s) as 'target word(s)'. The results are as follows. First, the descriptors which show high frequencies of target words are the traditionally important themes of scientific literacy education. Target words which show relatively high frequency are 'education', 'literacy', 'instruction' and 'countries'. Low frequency word is 'curriculum', which has various restrictive words and represents wide differentiation. Second, among the descriptors which show low frequencies of target words, relatively high frequency descriptors are '(and)society', 'change', 'secondary education', 'concepts', and 'biology', which have been given more attention in scientific literacy research than the rest descriptors. Third, the number of the descriptors that shows largely distributed pattern A, which happens over 15 years continuously, is over the half of all analyzed descriptors, which shows that they have been the major objectives in researches about scientific literacy. Most descriptors of pattern A shows normal distribution of frequency or the trends of increasing frequency as the time is nearer. Fourth, The descriptors are divided into four groups according to the time span. Each research trends are as follows. In later 80s, the research which emphasizes the importance of the sociality and technology in all level school science curriculum. In later 90s the research for educational change of inquiry-centered science curriculum which considers technological literacy in social contexts. In earlier 2000s the research that scientists and science teachers develop science curricula mostly related to scientific principles and thinking in chemistry and biology especially. In later 2000s case studies which relates teaching methods and science process activities to students' attitudes, scientific concepts and curricula.

  • PDF

The Syllable Type and Token Frequency Effect in Naming Task (명명 과제에서 음절 토큰 및 타입 빈도 효과)

  • Kwon, Youan
    • Korean Journal of Cognitive Science
    • /
    • v.25 no.2
    • /
    • pp.91-107
    • /
    • 2014
  • The syllable frequency effect is defined as the inhibitory effect that words starting with high frequency syllable generate a longer lexical decision latency and a larger error rate than words starting with low frequency syllable do. Researchers agree that the reason of the inhibitory effect is the interference from syllable neighbors sharing a target's first syllable at the lexical level and the degree of the interference effect correlates with the number of syllable neighbors or stronger syllable neighbors which have a higher word frequency. However, although the syllable frequency can be classified as the syllable type and token frequency, previous studies in visual word recognition have used the syllable frequency without the classification. Recently Conrad, Carreiras, & Jacobs (2008) demonstrated that the syllable type frequency might reflect a sub-lexical processing level including matching from letters to syllables and the syllable token frequency might reflect competitions between a target and higher frequency words of syllable neighbors in the whole word lexical processing level. Therefore, the present study investigated their proposals using word naming tasks. Generally word naming tasks are more sensitive to sub-lexical processing. Thus, the present study expected a facilitative effect of high syllable type frequency and a null effect of high syllable token frequency. In Experiment 1, words starting with high syllable type frequency generated a faster naming latency than words starting with low syllable type frequency with holding syllable token frequency of them. In Experiment 2, high syllable token frequency also created a shorter naming time than low syllable token frequency with holding their syllable type frequency. For that reason, we rejected the propose of Conrad et al. and suggested that both type and token syllable frequency could relate to the sub-lexical processing.

A Study on the Diachronic Evolution of Ancient Chinese Vocabulary Based on a Large-Scale Rough Annotated Corpus

  • Yuan, Yiguo;Li, Bin
    • Asia Pacific Journal of Corpus Research
    • /
    • v.2 no.2
    • /
    • pp.31-41
    • /
    • 2021
  • This paper makes a quantitative analysis of the diachronic evolution of ancient Chinese vocabulary by constructing and counting a large-scale rough annotated corpus. The texts from Si Ku Quan Shu (a collection of Chinese ancient books) are automatically segmented to obtain ancient Chinese vocabulary with time information, which is used to the statistics on word frequency, standardized type/token ratio and proportion of monosyllabic words and dissyllabic words. Through data analysis, this study has the following four findings. Firstly, the high-frequency words in ancient Chinese are stable to a certain extent. Secondly, there is no obvious dissyllabic trend in ancient Chinese vocabulary. Moreover, the Northern and Southern Dynasties (420-589 AD) and Yuan Dynasty (1271-1368 AD) are probably the two periods with the most abundant vocabulary in ancient Chinese. Finally, the unique words with high frequency in each dynasty are mainly official titles with real power. These findings break away from qualitative methods used in traditional researches on Chinese language history and instead uses quantitative methods to draw macroscopic conclusions from large-scale corpus.

Comparative Analysis in Perception of Retro Fashion and New-tro Fashion Using Big Data (빅 데이터를 활용한 레트로 패션과 뉴트로 패션에 대한 인식 비교)

  • Kyung Ja Paek;Jeong-Mee Kim
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.25 no.1
    • /
    • pp.83-96
    • /
    • 2023
  • The purpose of this study is to compare and analyze the perception of retro fashion and new-tro fashion using big data. TEXTOM allowed the collection of big data on the words 'retro fashion' and 'new-tro fashion', which was refined afterwards. As for the data collection period, Jan. 1, 2019 to Nov. 30, 2022 was set. A top 50 list of words were extracted from this data based on appearance frequency. The extracted words were processed through Network centrality analysis and CONCOR analysis using Ucinet 6. The results are as follows. 1) In retro fashion, the appearance frequency of 'style' was the highest, followed by 'sensibility', 'color', 'trend', 'fashion', and 'brand'. These words came up with high TF-IDF values. Network centrality analysis discovered that 'color', 'style', 'trend', 'sensibility', and 'design' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; trends, styles, looks, and photos. 2) In new-tro fashion, the appearance frequency of 'retro' was the highest, followed by 'trend', 'generation', 'style', 'brand', and 'fashion'. These words also came up with high TF-IDF values. Network centrality analysis found that 'retro', 'trend', 'generation', and 'brand' had high level of connectivity with other words. CONCOR analysis showed a total of four significant groups; style, brand, clothing, and trend. 3) New-tro fashion is included in retro fashion in that it reproduces the styles of the past. However, it is taken completely differently from generation to generation. Unlike the older generations, millennials actively accept newly created clothes and brands based on the past styles. They perceive it as a fashion that reveals their own unique tastes and tastes.

Correlation between the frequency of word and the deletion of segment (우리말 어휘빈도 정보와 분절음 탈락의 관련성에 대한 연구)

  • Cha Jae-Eun
    • MALSORI
    • /
    • no.47
    • /
    • pp.1-13
    • /
    • 2003
  • The purpose of this paper is to research the correlation between frequency and the deletion of /w, (equation omitted)/ in Korean. For this purpose, I select 11 words from the frequency data, then, analyze the speech of 20 speakers of standard Korean. As a result, I can find that there is correlation between the frequency and the deletion rate of segment. The rate of deletion is higher in high frequency words, while the rate of realization is higher in low frequency words. Although there is correlation between the frequency and the deletion rate of segment, the feature of segment, prosodic environments are more important in segment deletion.

  • PDF

The Effect of Word Frequency and Neighborhood Density on Spoken Word Segmentation in Korean (단어 빈도와 음절 이웃 크기가 한국어 명사의 음성 분절에 미치는 영향)

  • Song, Jin-Young;Nam, Ki-Chun;Koo, Min-Mo
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.3-20
    • /
    • 2012
  • The purpose of this study was to investigate whether a segmentation unit for a Korean noun is a 'syllable' and whether the process of segmenting spoken words occurs at the lexical level. A syllable monitoring task was administered which required participants to detect an auditorily presented target from visually presented words. In Experiment 1, syllable neighborhood density of high frequency words which can be segmented into both CV-CVC and CVC-VC were controlled. The syllable effect and the neighborhood density effect were significant, and the syllable effect emerged differently depending on the syllable neighborhood density. Similar results were obtained in Experiment 2 where low frequency words were used. The significance of word frequency effect on syllable effect was also examined. The results of Experiments 1 and 2 indicated that the segmentation unit for a Korean noun is indeed a 'syllable', and this process can occur at the lexical level.

Analysis of Mission, Vision and Core values in Korean Tertiary General Hospitals Through Text Mining (텍스트 마이닝을 통한 상급종합병원의 미션, 비전, 핵심가치 분석 연구)

  • Ji-Hoon Lee
    • Korea Journal of Hospital Management
    • /
    • v.28 no.2
    • /
    • pp.32-43
    • /
    • 2023
  • Purposes: This research is conducted to identify main features and trends of mission, vision and core values in Korean tertiary general hospitals by using text-mining. Methodology: For the study, 45 mission, 112 vision and 190 core values are collected from 45 tertiary general hospitals' homepages in 2022 and use word frequency analysis and Leyword co-occurrence analysis. Findings: In the tertiary general hospitals' mission, there are high frequency words such as 'health', 'humanity', 'medical treatment', 'education', 'research', 'happiness', 'love', 'best', 'spirit', and mission mainly includes the content of contributing humanity's health and happiness with these words. In case of vision, high frequency words are 'hospital', 'medical treatment', 'research', 'lead', 'trust', 'centered', 'patient', 'best', 'future'. By using these words in vision, it represents the definition and characteristics of vision such as ideal organizations in the future, goals and targets. As a result of the Leyword co-occurrence analysis, vision includes the content of 'high-tech medical treatment', 'special care for patients', 'leading education and research', 'the highest trust with customer', 'creative talents training'. -astly, the high frequency word-pairs in core values are 'social distribution', 'innovation pursuit', 'cooperation and harmony', and it defines standards of behavior for organizations. Practical Implication: To correct the problems of vision, mission and core values from findings, firstly, it needs for Korean tertiary general hospitals to use the words that can explain organization's identity and differentiate others in their mission. Secondly, considering strengthening the role of hospitals in their community and the importance of members in organizations, it is necessary to establish vision with considering community and members to activate vision effectively. Thirdly, because there are no specific guidelines of establishing mission, vision and core values for healthcare organizations, this research concepts and results could be utilized when other organizations establish mission, vision and core values.

  • PDF