• Title/Summary/Keyword: 핵심 단어 시각화

Search Result 16, Processing Time 0.019 seconds

English Bible Text Visualization Using Word Clouds and Dynamic Graphics Technology (단어 구름과 동적 그래픽스 기법을 이용한 영어성경 텍스트 시각화)

  • Jang, Dae-Heung
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.3
    • /
    • pp.373-386
    • /
    • 2014
  • A word cloud is a visualization of word frequency in a given text. The importance of each word is shown in font size or color. This plot is useful for quickly perceiving the most prominent words and for locating a word alphabetically to determine its relative prominence. With dynamic graphics, we can find the changing pattern of prominent words and their frequencies according to the changing selection of chapters in a given text. We can define the word frequency matrix. In this matrix, rows are chapters in text and columns are ranks corresponding to word frequency about the words in the text. We can draw the word frequency matrix plot with this matrix. Dynamic graphic can indicate the changing pattern of the word frequency matrix according to the changing selection of the range of ranks of words. We execute an English Bible text visualization using word clouds and dynamic graphics technology.

Analysis of Vocabulary Relations by Dimensional Reduction for Word Vectors Visualization (차원감소 단어벡터 시각화를 통한 어휘별 관계 분석)

  • Ko, Kwang-Ho;Paik, Juryon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.13-16
    • /
    • 2022
  • LSTM과 같은 딥러닝 기법을 이용해 언어모델을 얻는 과정에서 일종의 부산물로 학습 대상인 말뭉치를 구성하는 어휘의 단어벡터를 얻을 수 있다. 단어벡터의 차원을 2차원으로 감소시킨 후 이를 평면에 도시하면 대상 문장/문서의 핵심 어휘 사이의 상대적인 거리와 각도 등을 직관적으로 확인할 수 있다. 본 연구에서는 기형도의 시(詩)을 중심으로 특정 작품을 선정한 후 시를 구성하는 핵심 어휘들의 차원 감소된 단어벡터를 2D 평면에 도시하여, 단어벡터를 얻기 위한 텍스트 전처리 방식에 따라 그 거리/각도가 달라지는 양상을 분석해 보았다. 어휘 사이의 거리에 의해 군집/분류의 결과가 달라질 수 있고, 각도에 의해 유사도/유추 연산의 결과가 달라질 수 있으므로, 평면상에서 핵심 어휘들의 상대적인 거리/각도의 직관적 확인을 통해 군집/분류작업과 유사도 추천/유추 등의 작업 결과의 양상 변화를 확인할 수 있었다. 이상의 결과를 통해, 영화 추천/리뷰나 문학작품과 같이 단어 하나하나의 배치에 따라 그 분위기와 정동이 달라지는 분야의 경우 텍스트 전처리에 따른 거리/각도 변화를 미리 직관적으로 확인한다면 분류/유사도 추천과 같은 작업을 좀 더 정밀하게 수행할 수 있을 것으로 판단된다.

  • PDF

A Study on the Recognition of Population Problems of Male and Female Students using Text-mining: To Drive the Implications of Population Education (텍스트마이닝기법을 활용한 남녀 학생의 인구문제에 관한 인식 분석: 인구교육의 시사점 도출을 위하여)

  • Wang, Seok-Soon;Shim, Joon-Young
    • Journal of Korean Home Economics Education Association
    • /
    • v.31 no.3
    • /
    • pp.73-90
    • /
    • 2019
  • The purpose of this study was to explore the differences in perceptions of male and female students about population problems and to draw up implications for population education. Using text mining, the report about population problem, which had written by students in population education class, were analysed. After extracting key words, semantic networks were visualized. The results were as follows. First, the high frequency words were the same for each gender. Second, key words based on frequency did not differ depending on gender. And the key words extracted by the correlation analysis and bigram were different. That is, in the semantic network of girls' words, the network of "life"-"marriage"-"birth"-"pregnancy" appeared independently, distinguishing it from male students who showed separate objective links to population problems. Therefore, it drew suggestions that male and female students should be viewed as heterogeneous groups with different cognitive structures on population problems and that the content and methods of population education should be approached differently depending on gender.

A Study on Enhancing Emotional Engagement in Learning Situation - Based on Development Case of English Learning Serious Game 'Word Collectrian' (학습 장면에서 감정 개입을 촉진하기 위한 기능성 게임의 활용 - 단어 시각화 기반의 영어 학습용 기능성 게임 '워드 콜렉트리안' 제작 사례를 바탕으로)

  • Lee, Haksu;Doh, Young Yim
    • Journal of Korea Game Society
    • /
    • v.12 no.6
    • /
    • pp.95-106
    • /
    • 2012
  • Emotion is very important feature in educational situation. Because it has high influence to memory, educational achievement, motivation. This study tried to find out possibility of serious game as emotional engagement tool in educational situation. We did our pilot experiment to elementary school students who are english as second language. In this L2 learning situation, we did our basic experiment with English language learning serious game called 'Word Collectrian". Word Collectrian has some features for emotional engagement. It has interaction for dynamic word visualization, providing context video for word usage, putting visualized word on learner's virtual home. According to experimental result, word Collectrian has possibility for educational achievement and emotional engagement effect.

Binary Visual Word Generation Techniques for A Fast Image Search (고속 이미지 검색을 위한 2진 시각 단어 생성 기법)

  • Lee, Suwon
    • Journal of KIISE
    • /
    • v.44 no.12
    • /
    • pp.1313-1318
    • /
    • 2017
  • Aggregating local features in a single vector is a fundamental problem in an image search. In this process, the image search process can be speeded up if binary features which are extracted almost two order of magnitude faster than gradient-based features are utilized. However, in order to utilize the binary features in an image search, it is necessary to study the techniques for clustering binary features to generate binary visual words. This investigation is necessary because traditional clustering techniques for gradient-based features are not compatible with binary features. To this end, this paper studies the techniques for clustering binary features for the purpose of generating binary visual words. Through experiments, we analyze the trade-off between the accuracy and computational efficiency of an image search using binary features, and we then compare the proposed techniques. This research is expected to be applied to mobile applications, real-time applications, and web scale applications that require a fast image search.

A Proposal for Improving the Measurement and Management of Unit Water Content in In-Situ Concrete (현장 타설 콘크리트의 단위수량 측정 및 관리 개선 방안 제시)

  • Yun, Ja-yeon;Jang, Hyo-Jun;Lee, Taegyu;Choi, Hyeonggil
    • Journal of the Korea Institute of Building Construction
    • /
    • v.24 no.3
    • /
    • pp.319-329
    • /
    • 2024
  • This study examined domestic and international regulations concerning concrete unit weight, along with an evaluation of unit weight in concrete poured on construction sites. Fluctuations in unit weight were observed to correlate with concrete quality issues such as material separation, bleeding, and latency. A word cloud analysis, centered on the concept of concrete quality, further highlighted the significant influence of unit weight. Comparative analysis between Korea and Japan revealed few substantial differences in unit weight management and measurement techniques. However, calculation of concrete unit weight at delivery, using the unit volume mass method, indicated considerable variability among random on-site samples. Notably, the unit weight often exceeded the recommended standard. These findings emphasize the necessity for strict adherence to unit weight standards by all stakeholders involved in concrete production and construction, including ready-mix concrete (REMICON) producers, construction firms, and inspectors. To ensure consistent quality of cast concrete on-site, the establishment of a more comprehensive and practical system is recommended, incorporating measures such as on-site inspections.

A Study on the Use of Stopword Corpus for Cleansing Unstructured Text Data (비정형 텍스트 데이터 정제를 위한 불용어 코퍼스의 활용에 관한 연구)

  • Lee, Won-Jo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.891-897
    • /
    • 2022
  • In big data analysis, raw text data mostly exists in various unstructured data forms, so it becomes a structured data form that can be analyzed only after undergoing heuristic pre-processing and computer post-processing cleansing. Therefore, in this study, unnecessary elements are purified through pre-processing of the collected raw data in order to apply the wordcloud of R program, which is one of the text data analysis techniques, and stopwords are removed in the post-processing process. Then, a case study of wordcloud analysis was conducted, which calculates the frequency of occurrence of words and expresses words with high frequency as key issues. In this study, to improve the problems of the "nested stopword source code" method, which is the existing stopword processing method, using the word cloud technique of R, we propose the use of "general stopword corpus" and "user-defined stopword corpus" and conduct case analysis. The advantages and disadvantages of the proposed "unstructured data cleansing process model" are comparatively verified and presented, and the practical application of word cloud visualization analysis using the "proposed external corpus cleansing technique" is presented.

Development of big data based Skin Care Information System SCIS for skin condition diagnosis and management

  • Kim, Hyung-Hoon;Cho, Jeong-Ran
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.3
    • /
    • pp.137-147
    • /
    • 2022
  • Diagnosis and management of skin condition is a very basic and important function in performing its role for workers in the beauty industry and cosmetics industry. For accurate skin condition diagnosis and management, it is necessary to understand the skin condition and needs of customers. In this paper, we developed SCIS, a big data-based skin care information system that supports skin condition diagnosis and management using social media big data for skin condition diagnosis and management. By using the developed system, it is possible to analyze and extract core information for skin condition diagnosis and management based on text information. The skin care information system SCIS developed in this paper consists of big data collection stage, text preprocessing stage, image preprocessing stage, and text word analysis stage. SCIS collected big data necessary for skin diagnosis and management, and extracted key words and topics from text information through simple frequency analysis, relative frequency analysis, co-occurrence analysis, and correlation analysis of key words. In addition, by analyzing the extracted key words and information and performing various visualization processes such as scatter plot, NetworkX, t-SNE, and clustering, it can be used efficiently in diagnosing and managing skin conditions.

Text Mining of Successful Casebook of Agricultural Settlement in Graduates of Korea National College of Agriculture and Fisheries - Frequency Analysis and Word Cloud of Key Words - (한국농수산대학 졸업생 영농정착 성공 사례집의 Text Mining - 주요단어의 빈도 분석 및 word cloud -)

  • Joo, J.S.;Kim, J.S.;Park, S.Y.;Song, C.Y.
    • Journal of Practical Agriculture & Fisheries Research
    • /
    • v.20 no.2
    • /
    • pp.57-72
    • /
    • 2018
  • In order to extract meaningful information from the excellent farming settlement cases of young farmers published by KNCAF, we studied the key words with text mining and created a word cloud for visualization. First, in the text mining results for the entire sample, the words 'CEO', 'corporate executive', 'think', 'self', 'start', 'mind', and 'effort' are the words with high frequency among the top 50 core words. Their ability to think, judge and push ahead with themselves is a result of showing that they have ability of to be managers or managers. And it is a expression of how they manages to achieve their dream without giving up their dream. The high frequency of words such as "father" and "parent" is due to the high ratio of parents' cooperation and succession. Also 'KNCAF', 'university', 'graduation' and 'study' are the results of their high educational awareness, and 'organic farming' and 'eco-friendly' are the result of the interest in eco-friendly agriculture. In addition, words related to the 6th industry such as 'sales' and 'experience' represent their efforts to revitalize farming and fishing villages. Meanwhile, 'internet', 'blog', 'online', 'SNS', 'ICT', 'composite' and 'smart' were not included in the top 50. However, the fact that these words were extracted without omission shows that young farmers are increasingly interested in the scientificization and high-tech of agriculture and fisheries Next, as a result of grouping the top 50 key words by crop, the words 'facilities' in livestock, vegetables and aquatic crops, the words 'equipment' and 'machine' in food crops were extracted as main words. 'Eco-friendly' and 'organic' appeared in vegetable crops and food crops, and 'organic' appeared in fruit crops. The 'worm' of eco-friendly farming method appeared in the food crops, and the 'certification', which means excellent agricultural and marine products, appeared only in the fishery crops. 'Production', which is related to '6th industry', appeared in all crops, 'processing' and 'distribution' appeared in the fruit crops, and 'experience' appeared in the vegetable crops, food crops and fruit crops. To visualize the extracted words by text mining, we created a word cloud with the entire samples and each crop sample. As a result, we were able to judge the meaning of excellent practices, which are unstructured text, by character size.

Functional MRI of Language: Difference of its Activated Areas and Lateralization according to the Input Modality (언어의 기능적 자기공명영상: 자극방법에 따른 활성화와 편재화의 차이)

  • Ryoo, Jae-Wook;Cho, Jae-Min;Choi, Ho-Chul;Park, Mi-Jung;Choi, Hye-Young;Kim, Ji-Eun;Han, Heon;Kim, Sam-Soo;Jeon, Yong-Hwan;Khang, Hyun-Soo
    • Investigative Magnetic Resonance Imaging
    • /
    • v.15 no.2
    • /
    • pp.130-138
    • /
    • 2011
  • Purpose : To compare fMRIs of visual and auditory word generation tasks, and to evaluate the difference of its activated areas and lateralization according to the mode of stimuli. Materials and Methods : Eight male normal volunteers were included and all were right handed. Functional maps were obtained during auditory and visual word generation tasks in all. Normalized group analysis were performed in each task and the threshold for significance was set at p<0.05. Activated areas in each task were compared visually and statistically. Results : In both tasks, left dominant activations were demonstrated and were more lateralized in visual task. Both frontal lobes (Broca's area, premotor area, and SMA) and left posterior middle temporal gyrus were activated in both tasks. Extensive bilateral temporal activations were noted in auditory task. Both occipital and parietal activations were demonstrated in visual task. Conclusion : Modality independent areas could be interpreted as a core area of language function. Modality specific areas may be associated with processing of stimuli. Visual task induced more lateralized activation and could be a more useful in language study than auditory task.