• Title/Summary/Keyword: words

Search Result 9,087, Processing Time 0.048 seconds

Word Frequency-Based Big Data Analysis for the Annals of the Joseon Dynasty (조선왕조실록 분석을 위한 단어 빈도수 기반 빅 데이터 분석)

  • Bong, Young-Il;Lee, Choong-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.707-709
    • /
    • 2022
  • Annals of the Joseon Dynasty is a librarian that compiled the history of the Joseon Dynasty for 472 years, from Taejo to Cheoljong. The Annals of the Joseon Dynasty, National Treasure No. 151, are important documented heritages, but they are difficult to analyze due to their vast content. Therefore, rather than analyzing all the contents of the Annals of the Joseon Dynasty, it is necessary to extract and analyze important words. In this paper, we propose a method of extracting words from the main body of the Annals of the Joseon Dynasty through web crawling and analyzing the translated texts of the Annals of the Joseon Dynasty based on the data sorted according to the frequency of words. In this study, only the part of King Sejong of the Annals of the Joseon Dynasty was extracted and the importance was analyzed according to the frequency of words.

  • PDF

English vowel production conditioned by probabilistic accessibility of words: A comparison between L1 and L2 speakers

  • Jonny Jungyun Kim;Mijung Lee
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.1-7
    • /
    • 2023
  • This study investigated the influences of probabilistic accessibility of the word being produced - as determined by its usage frequency and neighborhood density - on native and high-proficiency L2 speakers' realization of six English monophthong vowels. The native group hyperarticulated the vowels over an expanded acoustic space when the vowel occurred in words with low frequency and high density, supporting the claim that vowel forms are modified in accordance with the probabilistic accessibility of words. However, temporal expansion occurred in words with greater accessibility (i.e., with high frequency and low density) as an effect of low phonotactic probability in low-density words, particularly in attended speech. This suggests that temporal modification in the opposite direction may be part of the phonetic characteristics that are enhanced in communicatively driven focus realization. Conversely, none of these spectral and temporal patterns were found in the L2 group, thereby indicating that even the high-proficiency L2 speakers may not have developed experience-based sensitivity to the modulation of sub-categorical phonetic details indexed with word-level probabilistic information. The results are discussed with respect to how phonological representations are shaped in a word-specific manner for the sake of communicatively driven lexical intelligibility, and what factors may contribute to the lack of native-like sensitivity in L2 speech.

Four-Year-Old Children's Counting Skills and Their Mothers' Use of Number Words: The Mediating Role of Children's Number Word Use (4세 유아의 수세기 기술과 어머니의 수 단어 사용: 유아 수 단어 사용의 매개효과)

  • Jihyeon Park;Youjeong Park;Yujin Lee;Sunjung Baik;Sukyoung Choe
    • Korean Journal of Childcare and Education
    • /
    • v.19 no.6
    • /
    • pp.79-95
    • /
    • 2023
  • Objective: This study examines the relationships among four-year-olds' counting skills, their use of number words, and their mothers' use of number words during mother-child free play. Specifically, we assess whether children's use of number words mediates the relationship between their counting skills and their mothers' use of number words during play. Methods: Forty-two 4-year-old children and their mothers were asked to play freely with a given set of toys at their home for 10 minutes. Children also completed a counting skill test. Frequencies of number word use were calculated for mothers and children from transcriptions of the free play. Results: Children's counting skills, the frequency of their number word use, and their mothers' frequency of number word use were positively correlated with each other. Additionally, the frequency of children's number-word use completely mediated the relationship between their counting skills and their mothers' frequency of number-word use. Conclusion/Implications: The results suggest that children's use of number language may play a crucial role in the provision of number-related language input by parents, based on their children's math skills. Practical implications of the findings are discussed.

Text Mining of Successful Casebook of Agricultural Settlement in Graduates of Korea National College of Agriculture and Fisheries - Frequency Analysis and Word Cloud of Key Words - (한국농수산대학 졸업생 영농정착 성공 사례집의 Text Mining - 주요단어의 빈도 분석 및 word cloud -)

  • Joo, J.S.;Kim, J.S.;Park, S.Y.;Song, C.Y.
    • Journal of Practical Agriculture & Fisheries Research
    • /
    • v.20 no.2
    • /
    • pp.57-72
    • /
    • 2018
  • In order to extract meaningful information from the excellent farming settlement cases of young farmers published by KNCAF, we studied the key words with text mining and created a word cloud for visualization. First, in the text mining results for the entire sample, the words 'CEO', 'corporate executive', 'think', 'self', 'start', 'mind', and 'effort' are the words with high frequency among the top 50 core words. Their ability to think, judge and push ahead with themselves is a result of showing that they have ability of to be managers or managers. And it is a expression of how they manages to achieve their dream without giving up their dream. The high frequency of words such as "father" and "parent" is due to the high ratio of parents' cooperation and succession. Also 'KNCAF', 'university', 'graduation' and 'study' are the results of their high educational awareness, and 'organic farming' and 'eco-friendly' are the result of the interest in eco-friendly agriculture. In addition, words related to the 6th industry such as 'sales' and 'experience' represent their efforts to revitalize farming and fishing villages. Meanwhile, 'internet', 'blog', 'online', 'SNS', 'ICT', 'composite' and 'smart' were not included in the top 50. However, the fact that these words were extracted without omission shows that young farmers are increasingly interested in the scientificization and high-tech of agriculture and fisheries Next, as a result of grouping the top 50 key words by crop, the words 'facilities' in livestock, vegetables and aquatic crops, the words 'equipment' and 'machine' in food crops were extracted as main words. 'Eco-friendly' and 'organic' appeared in vegetable crops and food crops, and 'organic' appeared in fruit crops. The 'worm' of eco-friendly farming method appeared in the food crops, and the 'certification', which means excellent agricultural and marine products, appeared only in the fishery crops. 'Production', which is related to '6th industry', appeared in all crops, 'processing' and 'distribution' appeared in the fruit crops, and 'experience' appeared in the vegetable crops, food crops and fruit crops. To visualize the extracted words by text mining, we created a word cloud with the entire samples and each crop sample. As a result, we were able to judge the meaning of excellent practices, which are unstructured text, by character size.

Speech Verification using Similar Word Information in Isolated Word Recognition (고립단어 인식에 유사단어 정보를 이용한 단어의 검증)

  • 백창흠;이기정홍재근
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1255-1258
    • /
    • 1998
  • Hidden Markov Model (HMM) is the most widely used method in speech recognition. In general, HMM parameters are trained to have maximum likelihood (ML) for training data. This method doesn't take account of discrimination to other words. To complement this problem, this paper proposes a word verification method by re-recognition of the recognized word and its similar word using the discriminative function between two words. The similar word is selected by calculating the probability of other words to each HMM. The recognizer haveing discrimination to each word is realized using the weighting to each state and the weighting is calculated by genetic algorithm.

  • PDF

A Study on the Duration of Korean medial fortis by Japanese Speakers (일본인 학습자의 국어 어중 경음 지속 시간 연구)

  • Noh, Seok-Eun
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.67-70
    • /
    • 2005
  • The purpose of this paper is the comparison of the Korean medial fortis duration between Korean native speaker and Japanese native speaker who study Korean language. For this purpose, I selected words with medial fortis from the SITEC DB. The Korean medial fortis of Japanese tends to have longer closure/friction duration than Korean native speakers in 3 syllables words. There are no distinct differences in 2 syllables words. This might be owing to the different timing unit of Korean and Japanese.

  • PDF

Prosodic Conditions for Epenthetic Nasals

  • Kim, Soo-Jung
    • Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.123-148
    • /
    • 2000
  • This paper investigates prosodic conditions for the epenthetic /n/ in Korean. It has been claimed that an epenthetic /n/ appears across prosodic words (Han 1994, Lee 1996). However, using acoustic data as well as aerodynamic data, I argue that the epenthetic /n/ does not always surface across all prosodic words, but that its appearance is prosodically restricted. I further demonstrate that it appears only across prosodic words within an accentual phrase. This finding provides empirical support for the intonation-based model of Korean prosodic structure studies.

  • PDF

A STUDY ON THE RECOGNITION OF SPOKEN KOREAN LOCAL-NAMES USING SPATIO TEMPORAL

  • Song, Do-Sun;Kim, Suk-Dong;Lee, Haing-Sei
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1003-1008
    • /
    • 1994
  • This paper is about an experiment of speaker-independent automation Korean spoken words recognition using Multi-Layered Perceptron and Error Back-propagation algorithm. The words were not segmented into syllables or phonemes, and some feature components extracted from the words in equal gap were applied to the neural network. This paper tried to find out the optimum conditions through various experiment which are comparison between total and pre-classified training.

  • PDF

Classification of Keywords of the papers from the Journal of Korean Academy of Nursing Administration(2002-2006) (간호행정학회지 게재논문 주요어 분석(2002년${\sim}$2006년))

  • Seomun, Gyeong-Ae;Kim, In-A;Koh, Myung-Suk
    • Journal of Korean Academy of Nursing Administration
    • /
    • v.13 no.1
    • /
    • pp.118-122
    • /
    • 2007
  • Purpose: This study was to understand the major subjects of the recent nursing research in Nursing administration from keywords. Method: Keywords of journals were extracted and the frequency of the appearance of each key words was sorted by a descending order. Results: A total of 327 key words were used. The most frequently used key words were 'Job satisfaction', 'Organizational commitment', 'Leadership'. Out of them, organizational culture, nursing performance, nursing classification, patient satisfaction, and ethics appeared most frequently in descending order. Conclusion: From the above it can be noted that many nursing administration concepts were handled in the papers. But there were not enough papers on the characteristics of the Nursing administration. It is suggested that in depth research be made on 'Nursing error', 'Nursing informatics', 'Web based learning'.

  • PDF

Classification of Documents using Automatic Indexing (자동 색인을 이용한 문서의 분류)

  • 신진섭;장수진
    • Journal of the Korea Society of Computer and Information
    • /
    • v.4 no.1
    • /
    • pp.21-27
    • /
    • 1999
  • In this paper. we propose a new method for automatic classification of documents using the degree of similarity between words. First, we seek relevance terms using automatic indexing. Second, we found frequency in use words in documents and the degree of relevance between the words using probability model. Continuously, we extracted the set of words which is connected the relevance closely and created the profiles characterizing each classification And, with the profile we finally classified them. We experimented on classifying two groups of documents. Some documents were about Genetic Algorithm. The others were about Neural Network. The results of the experiments indicated that automatic classification with word accordance of degree enable us to manage the retrieved documents structurally.

  • PDF