• Title/Summary/Keyword: Text Mining for Korean

Search Result 638, Processing Time 0.024 seconds

The Stream of Uncertainty in Scientific Knowledge using Topic Modeling (토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구)

  • Heo, Go Eun
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.1
    • /
    • pp.191-213
    • /
    • 2019
  • The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

A Study of Information Literacy Curriculum Using Topic Modeling (토픽모델링을 활용한 정보활용교육 연구주제 분석 및 교육내용 제안)

  • Jihye, Yun;Yoo Kyung, Jeong
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.4
    • /
    • pp.1-21
    • /
    • 2022
  • The aim of this study is to identify the research topics and suggest an information literacy curriculum by analyzing research articles on information literacy. For this purpose, we applied the topic modeling technique to 97 scientific articles and identified the core contents of information literacy education, such as media literacy, information literacy instruction, and the use of information resources. Based on the analysis results, we suggested an information literacy curriculum by considering the Big 6 model, information literacy standards of American Association of School Library, and Association of College and Research Libraries's information literacy competencies. This study is significant in that it considered 'use of information resources' and 'information ethics' to suggest information literacy education.

An Analysis of Domestic Newspaper Articles on 5.18 using the Bigkinds System (빅카인즈를 활용한 5·18 관련 국내 기사 분석 연구)

  • Juhyeon Park;Hyunji Park;Youngbum Gim
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.1
    • /
    • pp.107-132
    • /
    • 2024
  • This study attempted to analyze newspaper articles related to May 18 through frequency analysis and network analysis using news data related to May 18 for about 30 years from 1990 to 2022 at the Korea Press Foundation's Big Kinds. Specifically, quantitative change trends were examined by analyzing the amount of articles by period and region, and the connection structure between major keywords by the regime was explored through network analysis by regime using co-appearance keywords. As a result of the analysis, it was found that 2019 had the largest amount of coverage, which had many social issues in time, and the Jeolla-do region had the largest amount of coverage in the region. And as a result of network analysis, there were differences in words related to May 18 in news data according to the perception and policy of the regime toward May 18. As a result of synthesizing the analysis of May 18 news data, it was confirmed that May 18 was becoming a democratic movement over time regardless of region, but at the same time, the distortion of May 18 was not resolved.

A Study on the Perceptions of SW·AI Education for Elementary and Secondary School Teachers Using Text Mining (텍스트 마이닝을 이용한 초·중등 교사의 SW·AI 교육에 대한 인식 연구)

  • Mihyun Chung;Oakyoung Han;Kapsu Kim;Seungki Shin;Jaehyoun Kim
    • Journal of Internet Computing and Services
    • /
    • v.24 no.6
    • /
    • pp.57-64
    • /
    • 2023
  • This study analyzed the perceptions of elementary and secondary school teachers regarding the importance of SW/AI education in fostering students' fundamental knowledge and the necessity of integrating SW/AI into education. A total of 830 elementary and secondary school teachers were selected as study subjects using the judgment sampling method. The analysis of survey data revealed that elementary and secondary teachers exhibited a strong awareness of the importance and necessity of SW/AI education, irrespective of school characteristics, region, educational experience, or prior involvement in SW and AI education. Nevertheless, the primary reasons for not implementing SW/AI education were identified as excessive workload and a lack of pedagogical expertise. An analysis of opinions on the essential conditions for implementing SW/AI education revealed that workload reduction, budget support, teacher training to enhance teacher competency, content distribution, expansion of subject-linked courses, and dedicated instructional time allocation were the major influencing factors. These findings indicate a significant demand for comprehensive instructional support and teacher capacity-building programs.

Definition and Division in Intelligent Service Facility for Integrating Management (지능화시설의 통합운영관리를 위한 정의 및 구분에 관한 연구)

  • PARK, Jeong-Woo;YIM, Du-Hyun;NAM, Kwang-Woo;KIM, Jin-Young
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.4
    • /
    • pp.52-62
    • /
    • 2016
  • Smart City is urban development for complex problem solving that provides convenience and safety for citizens, and it is a blueprint for future cities. In 2008, the Korean government defined the construction, management, and government support of U-Cities in the legislation, Act on the Construction, Etc. of Ubiquitous Cities (Ubiquitous City Act), which included definitions of terms used in the act. In addition, the Minister of Land, Infrastructure and Transport has established a "ubiquitous city master plan" considering this legislation. The concept of U-Cities is complex, due to the mix of informatization and urban planning. Because of this complexity, the foundation of relevant regulations is inadequate, which is impeding the establishment and implementation of practical plans. Smart City intelligent service facilities are not easy to define and classify, because technology is rapidly changing and includes various devices for gathering and expressing information. The purpose of this study is to complement the legal definition of the intelligent service facility, which is necessary for integrated management and operation. The related laws and regulations on U-City were analyzed using text-mining techniques to identify insufficient legal definitions of intelligent service facilities. Using data gathered from interviews with officials responsible for constructing U-Cities, this study identified problems generated by implementing intelligent service facilities at the field level. This strategy should contribute to improved efficiency management, the foundation for building integrated utilization between departments. Efficiencies include providing a clear concept for establishing five-year renewable plans for U-Cities.

A Classification Model for Illegal Debt Collection Using Rule and Machine Learning Based Methods

  • Kim, Tae-Ho;Lim, Jong-In
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.4
    • /
    • pp.93-103
    • /
    • 2021
  • Despite the efforts of financial authorities in conducting the direct management and supervision of collection agents and bond-collecting guideline, the illegal and unfair collection of debts still exist. To effectively prevent such illegal and unfair debt collection activities, we need a method for strengthening the monitoring of illegal collection activities even with little manpower using technologies such as unstructured data machine learning. In this study, we propose a classification model for illegal debt collection that combine machine learning such as Support Vector Machine (SVM) with a rule-based technique that obtains the collection transcript of loan companies and converts them into text data to identify illegal activities. Moreover, the study also compares how accurate identification was made in accordance with the machine learning algorithm. The study shows that a case of using the combination of the rule-based illegal rules and machine learning for classification has higher accuracy than the classification model of the previous study that applied only machine learning. This study is the first attempt to classify illegalities by combining rule-based illegal detection rules with machine learning. If further research will be conducted to improve the model's completeness, it will greatly contribute in preventing consumer damage from illegal debt collection activities.

Development and Operation of Marine Environmental Portal Service System (해양환경 포탈서비스시스템 구축과 운영)

  • 최현우;권순철
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.05a
    • /
    • pp.338-341
    • /
    • 2003
  • According to a long-term master plan for the implementing of MOMAF's marine environmental informatization, we have developed marine environment portal web site which consists of 7 main-menu and 39 sub-menu including various types of contents (text, image and multimedia) based on RDBMS. This portal site was opened in Oct., 2002 (http://www.meps.info). Also, for the national institutions' distributed DB which is archived and managed respectively the marine chemical data and biological data, the integrated retrieval system was developed. This system is meaningful for the making collaborative use of real data and could be applied for data mining, marine research, marine environmental GIS and making-decisions.

  • PDF

A Study on the Research Trends on Open Innovation using Topic Modeling (토픽 모델링을 이용한 개방형 혁신 연구동향 분석 및 정책 방향 모색)

  • Cho, Sung-Bae;Shin, Shin-Ae;Kang, Dong-Seok
    • Informatization Policy
    • /
    • v.25 no.3
    • /
    • pp.52-74
    • /
    • 2018
  • In February 2018, the Korean government established the "Comprehensive Plans for Government Innovation" in order to realize 'the people-centered government'. The core of the comprehensive plans is participation of the people, which is very similar to open innovation where social issues are solved by ideas and capabilities of the private sector rather than those of the government. Therefore, this study was conducted by extracting open innovation topics through topic modeling based on LDA(Latent Dirichlet Allocation) as English abstract-data from 2003, when the plans for open innovation was first announced, to April 2018. Based on the extracted results, it also conducted a comparative analysis with "Comprehensive Plans for Government Innovation." The study has significant implications in that it derives the relationship between the subjects, analyzes the present policies of Korea on open innovation and suggests directions for development.

Methodology for Search Intent-based Document Recommendation

  • Lee, Donghoon;Kim, Namgyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.6
    • /
    • pp.115-127
    • /
    • 2021
  • It is not an easy task for a user to find the correct documents that a user really wanted at once from a vast amount of the search results. For this reason, various methods of recommending documents by taking the user's preferences into consideration based on the user's document browsing history have been proposed. However, the document recommendation methodology based on the document browsing history also has a limitation that only the information the user has viewed is utilized, but the intent of the user searching for the document is not fully utilized. Therefore, we propose a document recommendation method based on the user's search intent that utilizes information on "Why" the user reads the document, instead of the information on "Who" reads the document. In order to confirm the feasibility of the proposed methodology, an experiment was conducted by analyzing 239,438 actual user's search history of one of the most popular e-commerce platform companies in Korea. As a result, our methodology showed superior performance compared to the existing content-based or simple browsing history-based recommendation model.

Topic modeling and topic change trend analysis for advanced construction technologies (건설신기술에 대한 토픽 모델링 및 토픽 변화추이 분석)

  • Jeong, Seong Yun;Kim, Nam Gon
    • Smart Media Journal
    • /
    • v.10 no.4
    • /
    • pp.102-110
    • /
    • 2021
  • Currently, the advanced construction technology endorsement system is being operated to promote the development of domestic construction technology. We tried to examine the implicit meanings inherent in advanced construction technologies by analyzing the relationship between emerging vocabularies with high importance in relation to the advanced construction technologies endorsed through this system. For this purpose, 918 cases of advanced construction technology information were collected. Based on the endorsed year and summary of the advanced construction technologies, the importance of the emerging vocabularies was measured for each advanced construction technology. And, based on the LDA model, the degree of influence between related vocabularies was evaluated for each of the four topic areas. Topics according to the technical application fields were analyzed. From 1990 to 2021, the trend of changes in highly influential vocabularies by each topic was inferred. In the future, changes in the degree of influence of the topics of environment, machinery, facilities, and maintenance and reinforcement of structures and related technology fields were predicted.