• Title/Summary/Keyword: 단어빈도

Search Result 542, Processing Time 0.029 seconds

Analyzing Different Contexts for Energy Terms through Text Mining of Online Science News Articles (온라인 과학 기사 텍스트 마이닝을 통해 분석한 에너지 용어 사용의 맥락)

  • Oh, Chi Yeong;Kang, Nam-Hwa
    • Journal of Science Education
    • /
    • v.45 no.3
    • /
    • pp.292-303
    • /
    • 2021
  • This study identifies the terms frequently used together with energy in online science news articles and topics of the news reports to find out how the term energy is used in everyday life and to draw implications for science curriculum and instruction about energy. A total of 2,171 online news articles in science category published by 11 major newspaper companies in Korea for one year from March 1, 2018 were selected by using energy as a search term. As a result of natural language processing, a total of 51,224 sentences consisting of 507,901 words were compiled for analysis. Using the R program, term frequency analysis, semantic network analysis, and structural topic modeling were performed. The results show that the terms with exceptionally high frequencies were technology, research, and development, which reflected the characteristics of news articles that report new findings. On the other hand, terms used more than once per two articles were industry-related terms (industry, product, system, production, market) and terms that were sufficiently expected as energy-related terms such as 'electricity' and 'environment.' Meanwhile, 'sun', 'heat', 'temperature', and 'power generation', which are frequently used in energy-related science classes, also appeared as terms belonging to the highest frequency. From a network analysis, two clusters were found including terms related to industry and technology and terms related to basic science and research. From the analysis of terms paired with energy, it was also found that terms related to the use of energy such as 'energy efficiency,' 'energy saving,' and 'energy consumption' were the most frequently used. Out of 16 topics found, four contexts of energy were drawn including 'high-tech industry,' 'industry,' 'basic science,' and 'environment and health.' The results suggest that the introduction of the concept of energy degradation as a starting point for energy classes can be effective. It also shows the need to introduce high-tech industries or the context of environment and health into energy learning.

Analysis of Safety Education Contents of 『Field of home life』 in Technology·Home Economics Textbook developed by the revised curriculum in 2009 (2009 개정 기술·가정 교과서 『가정생활영역』의 안전교육 내용 분석)

  • Kim, Nam Eun
    • Journal of Korean Home Economics Education Association
    • /
    • v.29 no.2
    • /
    • pp.23-39
    • /
    • 2017
  • The Purpose of this study is to present the basic data for selecting and improving the safety education contents which help practically middle school students through analysis of contents of safety education in 'field of home life' of 2009 revised middle school textbooks. The subjects of analysis are 12 types of middle school textbooks: in total 24 books written by 12 publishers in terms of the revised curriculum in 2009. The analysis criteria is developed by the researcher referring to preceding studies regarding safety education based on the seventh safety education standard presented by the Ministry of Education (2015). With such analysis criteria, all words related to the contents of the safety education of analysis criteria were extracted from each textbook, such as words directly mentioned as 'safety', words mean as 'psychological safety' and 'happy life', words related to 'attention', 'note', 'stability' etc. Under the analytic frame of safety education contents according to a home economics textbook, content analysis method was used for producing the frequency and percent of those words. The textbook analysis shows that the number of pages regarding safety education is 336.3 pages, as 9.8% in total 3,412 pages of 12 types of technology and home economics textbooks. As following the analysis of each textbook volume of the proportion in the contents related to safety education, 224.9 pages are on the first volume and 111.9 pages are on the second volume. As grades increase from year one to year three, the proportion of safety education in home economics textbooks is decreased. The highest number of safety education contents unit is 'Self-management of youth' which includes three parts of safety education. In the case of a unit for emphasizing practice, experience and practical exercise such as 'Life of youth' and 'Practice of eco-living', safety education content in the area of 'life safety' are mostly contained. Safety accidents related to the most student experienced, Household accidents (1.4%) and experiment or practice accidents (0.3%) are presented in a low figure. The contents of universal housing and school violence are duplicated on first and second volume of text. The most presented safety education content in the 12 types of textbooks are proper sexual attitude, dietary problems, family conflict and food choice. The least common contents are dangerous drugs, family welfare, internet addiction and industrial accident compensation insurance. As this study is to analyze 12 textbooks developed in 2009 revision curriculum, it is necessary to compare it with the textbook written by the revised curriculum in 2015 and to clarify the contents system of safety education and to avoid duplication of contents. In addition, it is necessary to develop and distribute a safety education program that can support textbooks.

Automatic Extraction of Opinion Words from Korean Product Reviews Using the k-Structure (k-Structure를 이용한 한국어 상품평 단어 자동 추출 방법)

  • Kang, Han-Hoon;Yoo, Seong-Joon;Han, Dong-Il
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.6
    • /
    • pp.470-479
    • /
    • 2010
  • In relation to the extraction of opinion words, it may be difficult to directly apply most of the methods suggested in existing English studies to the Korean language. Additionally, the manual method suggested by studies in Korea poses a problem with the extraction of opinion words in that it takes a long time. In addition, English thesaurus-based extraction of Korean opinion words leaves a challenge to reconsider the deterioration of precision attributed to the one to one mismatching between Korean and English words. Studies based on Korean phrase analyzers may potentially fail due to the fact that they select opinion words with a low level of frequency. Therefore, this study will suggest the k-Structure (k=5 or 8) method, which may possibly improve the precision while mutually complementing existing studies in Korea, in automatically extracting opinion words from a simple sentence in a given Korean product review. A simple sentence is defined to be composed of at least 3 words, i.e., a sentence including an opinion word in ${\pm}2$ distance from the attribute name (e.g., the 'battery' of a camera) of a evaluated product (e.g., a 'camera'). In the performance experiment, the precision of those opinion words for 8 previously given attribute names were automatically extracted and estimated for 1,868 product reviews collected from major domestic shopping malls, by using k-Structure. The results showed that k=5 led to a recall of 79.0% and a precision of 87.0%; while k=8 led to a recall of 92.35% and a precision of 89.3%. Also, a test was conducted using PMI-IR (Pointwise Mutual Information - Information Retrieval) out of those methods suggested in English studies, which resulted in a recall of 55% and a precision of 57%.

Exploring the Research Trend Changes on Convergence Education of Before and After 2011 in Science Education (2011년 전후의 과학교육분야에서의 융합교육 연구동향의 변화 탐색)

  • Song, Youngwook;Paik, Seoung-Hey
    • Journal of The Korean Association For Science Education
    • /
    • v.40 no.5
    • /
    • pp.531-542
    • /
    • 2020
  • The purpose of this study is to explore the research trend changes of convergence education since 2011 compared to the convergence education research that has been steadily continuing in science education. The trend in convergence education were investigated by comparing the number of publications, research subjects, research content, and topic linkages with previous studies, and using the network analysis method to check recent research trends. In the field of science education, the number of papers related to convergence education has been published more than 8.0% steadily, and it has been increasing since 2012, then decreasing again from 2015 and gradually increasing again from 2017. The subjects of study were high in elementary school students, while those in middle school, high school, and university students were low. While the number of in-service teachers increased, the number of pre-service teachers decreased, and the literature and public increased somewhat. In study content, effectiveness studies decreased, while development studies increased, and theoretical and perception studies appeared similar. In thematic linkage, the intra-science linkage was 23.9%, and the extra-science linkage was 76.1% and engineering/technology and art were high in extra-science linkage. In network analysis, elementary, science, STEAM, and program words have a high frequency of appearance and appear together with other words to lead the network. The educational implications of the research trend of convergence education will be more emphasized in the field of science education in the future, and in order to take root in the education field, research on secondary students should be more actively studied. In addition, it is necessary to move away from research on STEAM-centered program development and effects, and to increase research to establish the philosophical basis and theoretical of convergence education.

Analysis of Research Topics among Library, Archives and Museums using Topic Modeling (토픽 모델링을 활용한 도서관, 기록관, 박물관간의 연구 주제 분석)

  • Kim, Heesop;Kang, Bora
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.339-358
    • /
    • 2019
  • The purpose of this study is to understand the topics of the research for the establishment of cooperative platform between libraries, archives, and museums that carry out the common task of providing knowledge information in a broad sense. To achieve the purpose of this study, 637 bibliographic information on three institutions were collected from the Web version of Scopus database. Among the collected bibliographic information, 5,218 words were extracted through NetMiner V.4 and analysed topic modeling. The results are as follows: First, as a result of analyzing the frequency of word appearance according to the tf-idf weight 'Preservation' was the most hottest topic. Second, the topic modeling analysis through LDA(Latent Dirichlet Allocation) algorithm resulted in 13 topic areas. Third, as a result of expressing 13 topic areas as a network, repository construction was the central topic, and the research topics such as cooperation among institutions, conservation environment for collections, system and policy discovery, life cycle of collections, exhibition of information resources, and information retrieval were closely related to the central topic. Fourth, the trend of 13 topic areas by year 1998 is limited to the specific subjects such as system and policy discovery, information retrieval, and life cycle of collections, while the subsequent studies have been carried out after that year.

Online Information Sources of Coronavirus Using Webometric Big Data (코로나19 사태와 온라인 정보의 다양성 연구 - 빅데이터를 활용한 글로벌 접근법)

  • Park, Han Woo;Kim, Ji-Eun;Zhu, Yu-Peng
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.11
    • /
    • pp.728-739
    • /
    • 2020
  • Using webometric big data, this study examines the diversity of online information sources about the novel coronavirus causing the COVID-19 pandemic. Specifically, it focuses on some 28 countries where confirmed coronavirus cases occurred in February 2020. In the results, the online visibility of Australia, Canada, and Italy was the highest, based on their producing the most relevant information. There was a statistically significant correlation between the hit counts per country and the frequency of visiting the domains that act as information channels. Interestingly, Japan, China, and Singapore, which had a large number of confirmed cases at that time, were providing web data related to the novel coronavirus. Online sources were classified using an N-tuple helix model. The results showed that government agencies were the largest supplier of coronavirus information in cyberspace. Furthermore, the two-mode network technique revealed that media companies, university hospitals, and public healthcare centers had taken a positive attitude towards online circulation of coronavirus research and epidemic prevention information. However, semantic network analysis showed that health, school, home, and public had high centrality values. This means that people were concerned not only about personal prevention rules caused by the coronavirus outbreak, but also about response plans caused by life inconveniences and operational obstacles.

A Study on Text Mining Methods to Analyze Civil Complaints: Structured Association Analysis (민원 분석을 위한 텍스트 마이닝 기법 연구: 계층적 연관성 분석)

  • Kim, HyunJong;Lee, TaiHun;Ryu, SeungEui;Kim, NaRang
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.23 no.3
    • /
    • pp.13-24
    • /
    • 2018
  • For government and public institutions, civil complaints containing direct requirements of citizens can be utilized as important data in developing policies. However, it is difficult to draw accurate requirements using text mining methods since the nature of the complaint text is unstructured. In this study, a new method is proposed that draws the exact requirements of citizens, improving the previous text mining in analyzing the data of civil complaints. The new text-mining method is based on the principle of Co-Occurrences Structure Map, and it is structured by two-step association analysis, so that it consists of the first-order related word and a second-order related word based on the core subject word. For the analysis, 3,004 cases posted on the electronic bulletin board of Busan City for the year 2016 are used. This study's academic contribution suggests a method deriving the requirements of citizens from the civil affairs data. As a practical contribution, it also enables policy development using civil service data.

An Analysis of News Report Characteristics on Archives & Records Management for the Press in Korea: Based on 1999~2018 News Big Data (뉴스 빅데이터를 이용한 우리나라 언론의 기록관리 분야 보도 특성 분석: 1999~2018 뉴스를 중심으로)

  • Han, Seunghee
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.3
    • /
    • pp.41-75
    • /
    • 2018
  • The purpose of this study is to analyze the characteristics of Korean media on the topic of archives & records management based on time-series analysis. In this study, from January, 1999 to June, 2018, 4,680 news articles on archives & records management topics were extracted from BigKinds. In order to examine the characteristics of the media coverage on the archives & records management topic, this study was analyzed to the difference of the press coverage by period, subject, and type of the media. In addition, this study was conducted word-frequency based content analysis and semantic network analysis to investigate the content characteristics of media on the subject. Based on these results, this study was analyzed to the differences of media coverage by period, subject, and type of media. As a result, the news in the field of records management showed that there was a difference in the amount of news coverage and news contents by period, subject, and type of media. The amount of news coverage began to increase after the Presidential Records Management Act was enacted in 2007, and the largest amount of news was reported in 2013. Daily newspapers and financial newspapers reported the largest amount of news. As a result of analyzing news reports, during the first 10 years after 1999, news topics were formed around the issues arising from the application and diffusion process of the concept of archives & records management. However, since the enactment of the Presidential Records Management Act, archives & records management has become a major factor in political and social issues, and a large amount of political and social news has been reported.

Research Trends of Young Children's Play Using the Semantic Network Analysis (언어네트워크분석을 통한 유아놀이 관련 연구동향 탐색)

  • Kim, Jong-Hoon;Park, Sun-Mi
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.2
    • /
    • pp.296-303
    • /
    • 2020
  • The purpose of this study was to examine the keywords of studies related to young children's play in the selected registered and candidate academic journals and the network of the keywords by conducting a semantic network analysis. The selected journals were published over the past decade in diverse fields of study that included social sciences and life sciences such as education and early childhood education. The findings of the study were as follows: First, there was a great increase in the studies related to early childhood play over the last five years in comparison with the first term(2009-2013). As a result of analyzing how many studies were included in the journals by field, the largest numbers of the studies were published in the field of education, followed by early childhood education, and life sciences. Second, when the network of the keywords was analyzed, the major keywords in the first term were playfulness, role play, young children, creativity, play, and peer play interaction. In the second term(2014-2018), playfulness was also the most frequently exhibited keyword, followed by young children, play, and peer play behavior. Keywords such as teacher-child interaction, language skills, happiness, cognitive ability, early childhood education newly appeared.

Time Series Analysis of Intellectual Structure and Research Trend Changes in the Field of Library and Information Science: 2003 to 2017 (문헌정보학 분야의 지적구조 및 연구 동향 변화에 대한 시계열 분석: 2003년부터 2017년까지)

  • Choi, Hyung Wook;Choi, Ye-Jin;Nam, So-Yeon
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.2
    • /
    • pp.89-114
    • /
    • 2018
  • Research on changes in research trends in academic disciplines is a method that enables observation of not only the detailed research subject and structure of the field but also the state of change in the flow of time. Therefore, in this study, in order to observe the changes of research trend in library and information science field in Korea, co-word analysis was conducted with Korean author keywords from three types of journals which were listed in the Korea Citation Index(KCI) and have top citation impact factor were selected. For the time series analysis, the 15-year research period was accumulated in 5-years units, and divided into 2003~2007, 2003~2012, and 2003~2017. The keywords which limited to the frequency of appearance 10 or more, respectively, were analyzed and visualized. As a result of the analysis, during the period from 2003 to 2007, the intellectual structure composed with 25 keywords and 8 areas was confirmed, and during the period from 2003 to 2012, the structure composed by 3 areas 17 sub-areas with 76 keywords was confirmed. Also, the intellectual structure during the period from 2003 to 2017 was crowded into 6 areas 32 consisting of a total of 132 keywords. As a result of comprehensive period analysis, in the field of library and information science in Korea, over the past 15 years, new keywords have been added for each period, and detailed topics have also been subdivided and gradually segmented and expanded.