• Title/Summary/Keyword: Important Performance Analysis

Search Result 5,244, Processing Time 0.041 seconds

A Study on Job Satisfaction of Records Managers (기록물관리 전문요원의 직무만족도에 관한 연구)

  • Yoo, Hyeon Gyeong;Kim, Soojung
    • The Korean Journal of Archival Studies
    • /
    • no.47
    • /
    • pp.95-130
    • /
    • 2016
  • The job satisfaction of records managers is of importance because it affects their work performance and retention. The purpose of this study is to investigate records managers' job satisfaction and to identify factors affecting records manager's job satisfaction to find the way to improve their job satisfaction. Specific questions of the study are as follows: 1) What is the job satisfaction of records managers? 2) Are factors affecting job satisfaction different depending on record managers' personal characteristics? 3) What are the most influential factors on job satisfaction? To do that, questionnaires were used to gather data from 60 domestic records managers working in different types of records centers. Data analyses included descriptive statistics, one-way ANOVA, independent t-test, and multiple-regression analysis. Additionally, interviews with 2 record managers were conducted to collect opinions on factors affecting job dissatisfaction and recommendations for improving their job satisfaction. Important findings of the study are as follows: First, the respondents are moderately satisfied with their jobs (3.2 out of 5 points). The level of job satisfaction is different depending on years of career, years of employment, number of personnel the respondent is working with in the records center, and etc. The number of personnel the respondent is working with was found to be the most influential factor. Second, multiple-regression analysis result shows that motivation factors(satisfaction factors) are more influential than hygiene factors (dissatisfaction factors) on the respondents' job satisfaction, which confirms Herzberg's two factor theory. More specifically, 'work ethic,' one of motivator factors, has the greatest influence, followed by 'procedural impartiality', 'communication', 'job characteristic', 'distributive justice', and 'working conditions.' Based on the results, this study suggests several ways to improve record managers' job satisfaction level. First, the awareness of records management should be increased. The respondents indicated that their job dissatisfaction is usually derived from a lack of the awareness of records management. Therefore, every chief of organizations, National Archives of Korea, and records managers themselves should try to raise the awareness of records management. Especially, records managers should make stronger efforts to attract the office's attention. Second, records managers ought to establish their identity as records management profession. Also, they should participate in various activities of the archival community to overcome the limitation of individuals.

Analysis of Optimal Resolution and Number of GCP Chips for Precision Sensor Modeling Efficiency in Satellite Images (농림위성영상 정밀센서모델링 효율성 재고를 위한 최적의 해상도 및 지상기준점 칩 개수 분석)

  • Choi, Hyeon-Gyeong;Kim, Taejung
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1445-1462
    • /
    • 2022
  • Compact Advanced Satellite 500-4 (CAS500-4), which is scheduled to be launched in 2025, is a mid-resolution satellite with a 5 m resolution developed for wide-area agriculture and forest observation. To utilize satellite images, it is important to establish a precision sensor model and establish accurate geometric information. Previous research reported that a precision sensor model could be automatically established through the process of matching ground control point (GCP) chips and satellite images. Therefore, to improve the geometric accuracy of satellite images, it is necessary to improve the GCP chip matching performance. This paper proposes an improved GCP chip matching scheme for improved precision sensor modeling of mid-resolution satellite images. When using high-resolution GCP chips for matching against mid-resolution satellite images, there are two major issues: handling the resolution difference between GCP chips and satellite images and finding the optimal quantity of GCP chips. To solve these issues, this study compared and analyzed chip matching performances according to various satellite image upsampling factors and various number of chips. RapidEye images with a resolution of 5m were used as mid-resolution satellite images. GCP chips were prepared from aerial orthographic images with a resolution of 0.25 m and satellite orthogonal images with a resolution of 0.5 m. Accuracy analysis was performed using manually extracted reference points. Experiment results show that upsampling factor of two and three significantly improved sensor model accuracy. They also show that the accuracy was maintained with reduced number of GCP chips of around 100. The results of the study confirmed the possibility of applying high-resolution GCP chips for automated precision sensor modeling of mid-resolution satellite images with improved accuracy. It is expected that the results of this study can be used to establish a precise sensor model for CAS500-4.

Comparative assessment and uncertainty analysis of ensemble-based hydrologic data assimilation using airGRdatassim (airGRdatassim을 이용한 앙상블 기반 수문자료동화 기법의 비교 및 불확실성 평가)

  • Lee, Garim;Lee, Songhee;Kim, Bomi;Woo, Dong Kook;Noh, Seong Jin
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.10
    • /
    • pp.761-774
    • /
    • 2022
  • Accurate hydrologic prediction is essential to analyze the effects of drought, flood, and climate change on flow rates, water quality, and ecosystems. Disentangling the uncertainty of the hydrological model is one of the important issues in hydrology and water resources research. Hydrologic data assimilation (DA), a technique that updates the status or parameters of a hydrological model to produce the most likely estimates of the initial conditions of the model, is one of the ways to minimize uncertainty in hydrological simulations and improve predictive accuracy. In this study, the two ensemble-based sequential DA techniques, ensemble Kalman filter, and particle filter are comparatively analyzed for the daily discharge simulation at the Yongdam catchment using airGRdatassim. The results showed that the values of Kling-Gupta efficiency (KGE) were improved from 0.799 in the open loop simulation to 0.826 in the ensemble Kalman filter and to 0.933 in the particle filter. In addition, we analyzed the effects of hyper-parameters related to the data assimilation methods such as precipitation and potential evaporation forcing error parameters and selection of perturbed and updated states. For the case of forcing error conditions, the particle filter was superior to the ensemble in terms of the KGE index. The size of the optimal forcing noise was relatively smaller in the particle filter compared to the ensemble Kalman filter. In addition, with more state variables included in the updating step, performance of data assimilation improved, implicating that adequate selection of updating states can be considered as a hyper-parameter. The simulation experiments in this study implied that DA hyper-parameters needed to be carefully optimized to exploit the potential of DA methods.

Seasonal Variations of Microphytobenthos in Sediments of the Estuarine Muddy Sandflat of Gwangyang Bay: HPLC Pigment Analysis (광합성색소 분석을 통한 광양만 갯벌 퇴적물 중 저서미세조류의 계절변화)

  • Lee, Yong-Woo;Choi, Eun-Jung;Kim, Young-Sang;Kang, Chang-Keun
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.14 no.1
    • /
    • pp.48-55
    • /
    • 2009
  • Seasonal variations of microalgal biomass and community composition in both the sediment and the seawater were investigated by HPLC pigment analysis in an estuarine muddy sandflat of Gwangyang Bay from January to November 2002. Based on the photosynthetic pigments, fucoxanthin, diadinoxanthin, and diatoxanthin were the most dominant pigments all the year round, indicating that diatoms were the predominant algal groups of both the sediment and the seawater in Gwangyang Bay. The other algal pigments except the diatom-marker pigments showed relatively low concentrations. Microphytobenthic chlorophyll ${\alpha}$ concentrations in the upper layer (0.5 cm) of sediments ranged from 3.44 (March at the middle site of the tidal flat) to 169 (July at the upper site) mg $m^{-2}$, with the annual mean concentrations of $68.4{\pm}45.5,\;21.3{\pm}14.3,\;22.9{\pm}15.6mg\;m^{-2}$ at the upper, middle, and lower tidal sites, respectively. Depth-integrated chlorophyll ${\alpha}$ concentrations in the overlying water column ranged from 1.66 (November) to 11.7 (July) mg $m^{-2}$, with an annual mean of $6.96{\pm}3.04mg\;m^{-2}$. Microphytobenthic biomasses were about 3${\sim}$10 times higher than depth-integrated phytoplankton biomass in the overlying water column. The physical characteristics of this shallow estuarine tidal flat, similarity in taxonomic composition of the phytoplankton and microphytobenthos, and similar seasonal patterns in their biomasses suggest that resuspended microphytobenthos are an important component of phytoplankton biomass in Gwangyang Bay. Therefore, considering the importance of microphytobenthos as possible food source for the estuarine benthic and pelagic consumers, a consistent monitoring work on the behavior of microphytobenthos is needed in the tidal flat ecosystems.

The Effect of Information Quality and System Quality on Knowledge Service Competence: Focusing on Knowledge Service Types (지식서비스의 정보품질과 시스템품질이 지식서비스 역량에 미치는 영향: 지식서비스 유형을 중심으로)

  • Geun-Wan Park;Hyun-Ji Park;Sung-Hoon Mo;Cheol-Hyun Lim;Hee-Seok Choi;Seok-Hyoung Lee;Hye-Jin Lee;Seung-June Hwang;Chang-Hee Han
    • Information Systems Review
    • /
    • v.21 no.4
    • /
    • pp.1-29
    • /
    • 2019
  • The knowledge resources take a role in promoting the sustainable growth of organization. Therefore, it is important for the members of organization to acquire knowledge consistently so that the company can continue to grow. Knowledge service is the field that provides information and infrastructure which enable the members of organization to acquire new knowledge. As we recognized the importance of knowledge services, we analyzed the level of knowledge service management and development through the impact of knowledge quality on user capabilities. First, the matrix of knowledge patterns was presented based on the type of information and the level of customer interaction. According to patterns, the knowledge service was classified into three types of information providing, information analysis, and infrastructure, and then the results of structural model analysis were presented for each type. It found that the impact of knowledge service quality on user competence was different according to the type of service. The results suggested new indicators for measuring the performance of knowledge services, and provided information for reconstructing services based on the user considering the integrated operation of knowledge service and organizational designing knowledge service.

Incorporating Social Relationship discovered from User's Behavior into Collaborative Filtering (사용자 행동 기반의 사회적 관계를 결합한 사용자 협업적 여과 방법)

  • Thay, Setha;Ha, Inay;Jo, Geun-Sik
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.1-20
    • /
    • 2013
  • Nowadays, social network is a huge communication platform for providing people to connect with one another and to bring users together to share common interests, experiences, and their daily activities. Users spend hours per day in maintaining personal information and interacting with other people via posting, commenting, messaging, games, social events, and applications. Due to the growth of user's distributed information in social network, there is a great potential to utilize the social data to enhance the quality of recommender system. There are some researches focusing on social network analysis that investigate how social network can be used in recommendation domain. Among these researches, we are interested in taking advantages of the interaction between a user and others in social network that can be determined and known as social relationship. Furthermore, mostly user's decisions before purchasing some products depend on suggestion of people who have either the same preferences or closer relationship. For this reason, we believe that user's relationship in social network can provide an effective way to increase the quality in prediction user's interests of recommender system. Therefore, social relationship between users encountered from social network is a common factor to improve the way of predicting user's preferences in the conventional approach. Recommender system is dramatically increasing in popularity and currently being used by many e-commerce sites such as Amazon.com, Last.fm, eBay.com, etc. Collaborative filtering (CF) method is one of the essential and powerful techniques in recommender system for suggesting the appropriate items to user by learning user's preferences. CF method focuses on user data and generates automatic prediction about user's interests by gathering information from users who share similar background and preferences. Specifically, the intension of CF method is to find users who have similar preferences and to suggest target user items that were mostly preferred by those nearest neighbor users. There are two basic units that need to be considered by CF method, the user and the item. Each user needs to provide his rating value on items i.e. movies, products, books, etc to indicate their interests on those items. In addition, CF uses the user-rating matrix to find a group of users who have similar rating with target user. Then, it predicts unknown rating value for items that target user has not rated. Currently, CF has been successfully implemented in both information filtering and e-commerce applications. However, it remains some important challenges such as cold start, data sparsity, and scalability reflected on quality and accuracy of prediction. In order to overcome these challenges, many researchers have proposed various kinds of CF method such as hybrid CF, trust-based CF, social network-based CF, etc. In the purpose of improving the recommendation performance and prediction accuracy of standard CF, in this paper we propose a method which integrates traditional CF technique with social relationship between users discovered from user's behavior in social network i.e. Facebook. We identify user's relationship from behavior of user such as posts and comments interacted with friends in Facebook. We believe that social relationship implicitly inferred from user's behavior can be likely applied to compensate the limitation of conventional approach. Therefore, we extract posts and comments of each user by using Facebook Graph API and calculate feature score among each term to obtain feature vector for computing similarity of user. Then, we combine the result with similarity value computed using traditional CF technique. Finally, our system provides a list of recommended items according to neighbor users who have the biggest total similarity value to the target user. In order to verify and evaluate our proposed method we have performed an experiment on data collected from our Movies Rating System. Prediction accuracy evaluation is conducted to demonstrate how much our algorithm gives the correctness of recommendation to user in terms of MAE. Then, the evaluation of performance is made to show the effectiveness of our method in terms of precision, recall, and F1-measure. Evaluation on coverage is also included in our experiment to see the ability of generating recommendation. The experimental results show that our proposed method outperform and more accurate in suggesting items to users with better performance. The effectiveness of user's behavior in social network particularly shows the significant improvement by up to 6% on recommendation accuracy. Moreover, experiment of recommendation performance shows that incorporating social relationship observed from user's behavior into CF is beneficial and useful to generate recommendation with 7% improvement of performance compared with benchmark methods. Finally, we confirm that interaction between users in social network is able to enhance the accuracy and give better recommendation in conventional approach.

A Prospective Randomized Comparative Clinical Trial Comparing the Efficacy between Ondansetron and Metoclopramide for Prevention of Nausea and Vomiting in Patients Undergoing Fractionated Radiotherapy to the Abdominal Region (복부 방사선치료를 받는 환자에서 발생하는 오심 및 구토에 대한 온단세트론과 메토클로프라미드의 효과 : 제 3상 전향적 무작위 비교임상시험)

  • Park Hee Chul;Suh Chang Ok;Seong Jinsil;Cho Jae Ho;Lim John Jihoon;Park Won;Song Jae Seok;Kim Gwi Eon
    • Radiation Oncology Journal
    • /
    • v.19 no.2
    • /
    • pp.127-135
    • /
    • 2001
  • Purpose : This study is a prospective randomized clinical trial comparing the efficacy and complication of anti-emetic drugs for prevention of nausea and vomiting after radiotherapy which has moderate emetogenic potential. The aim of this study was to investigate whether the anti-emetic efficacy of ondansetron $(Zofran^{\circledR})$ 8 mg bid dose (Group O) is better than the efficacy of metoclopramide 5 mg lid dose (Group M) in patients undergoing fractionated radiotherapy to the abdominal region. Materials and Methods : Study entry was restricted to those patients who met the following eligibility criteria: histologically confirmed malignant disease; no distant metastasis; performance status of not more than ECOG grade 2; no previous chemotherapy and radiotherapy. Between March 1997 and February 1998, 60 patients enrolled in this study. All patients signed a written statement of informed consent prior to enrollment. Blinding was maintained by dosing identical number of tablets including one dose of matching placebo for Group O. The extent of nausea, appetite loss, and the number of emetic episodes were recorded everyday using diary card. The mean score of nausea, appetite loss and the mean number of emetic episodes were obtained in a weekly interval. Results : Prescription error occurred in one patient. And diary cards have not returned in 3 patients due to premature refusal of treatment. Card from one patient was excluded from the analysis because she had a history of treatment for neurosis. As a result, the analysis consisted of 55 patients. Patient characteristics and radiotherapy characteristics were similar except mean age was $52.9{\pm}11.2$ in group M, $46.5{\pm}9.5$ in group O. The difference of age was statistically significant. The mean score of nausea, appetite loss and emetic episodes in a weekly interval was higher in group M than O. In group M, the symptoms were most significant at 5th week. In a panel data analysis using mixed procedure, treatment group was only significant factor detecting the difference of weekly score for all three symptoms. Ondansetron $(Zofran^{\circledR})$ 8 mg bid dose and metoclopramide 5 mg lid dose were well tolerated without significant side effects. There were no clinically important changes In vital signs or clinical laboratory parameters with either drug. Conclusion : Concerning the fact that patients with younger age have higher emetogenic potential, there are possibilities that age difference between two treatment groups lowered the statistical power of analysis. There were significant difference favoring ondansetron group with respect to the severity of nausea, vomiting and loss of appetite. We concluded that ondansetron is more effective anti-emetic agents in the control of radiotherapy-induced nausea, vomiting, loss of appetite without significant toxicity, compared with commonly used drug, i.e., metoclopramide. However, there were patients suffering emesis despite the administration of ondansetron. The possible strategies to improve the prevention and the treatment of radiotherapy-induced emesis must be further studied.

  • PDF

The Characteristics and Performances of Manufacturing SMEs that Utilize Public Information Support Infrastructure (공공 정보지원 인프라 활용한 제조 중소기업의 특징과 성과에 관한 연구)

  • Kim, Keun-Hwan;Kwon, Taehoon;Jun, Seung-pyo
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.1-33
    • /
    • 2019
  • The small and medium sized enterprises (hereinafter SMEs) are already at a competitive disadvantaged when compared to large companies with more abundant resources. Manufacturing SMEs not only need a lot of information needed for new product development for sustainable growth and survival, but also seek networking to overcome the limitations of resources, but they are faced with limitations due to their size limitations. In a new era in which connectivity increases the complexity and uncertainty of the business environment, SMEs are increasingly urged to find information and solve networking problems. In order to solve these problems, the government funded research institutes plays an important role and duty to solve the information asymmetry problem of SMEs. The purpose of this study is to identify the differentiating characteristics of SMEs that utilize the public information support infrastructure provided by SMEs to enhance the innovation capacity of SMEs, and how they contribute to corporate performance. We argue that we need an infrastructure for providing information support to SMEs as part of this effort to strengthen of the role of government funded institutions; in this study, we specifically identify the target of such a policy and furthermore empirically demonstrate the effects of such policy-based efforts. Our goal is to help establish the strategies for building the information supporting infrastructure. To achieve this purpose, we first classified the characteristics of SMEs that have been found to utilize the information supporting infrastructure provided by government funded institutions. This allows us to verify whether selection bias appears in the analyzed group, which helps us clarify the interpretative limits of our study results. Next, we performed mediator and moderator effect analysis for multiple variables to analyze the process through which the use of information supporting infrastructure led to an improvement in external networking capabilities and resulted in enhancing product competitiveness. This analysis helps identify the key factors we should focus on when offering indirect support to SMEs through the information supporting infrastructure, which in turn helps us more efficiently manage research related to SME supporting policies implemented by government funded institutions. The results of this study showed the following. First, SMEs that used the information supporting infrastructure were found to have a significant difference in size in comparison to domestic R&D SMEs, but on the other hand, there was no significant difference in the cluster analysis that considered various variables. Based on these findings, we confirmed that SMEs that use the information supporting infrastructure are superior in size, and had a relatively higher distribution of companies that transact to a greater degree with large companies, when compared to the SMEs composing the general group of SMEs. Also, we found that companies that already receive support from the information infrastructure have a high concentration of companies that need collaboration with government funded institution. Secondly, among the SMEs that use the information supporting infrastructure, we found that increasing external networking capabilities contributed to enhancing product competitiveness, and while this was no the effect of direct assistance, we also found that indirect contributions were made by increasing the open marketing capabilities: in other words, this was the result of an indirect-only mediator effect. Also, the number of times the company received additional support in this process through mentoring related to information utilization was found to have a mediated moderator effect on improving external networking capabilities and in turn strengthening product competitiveness. The results of this study provide several insights that will help establish policies. KISTI's information support infrastructure may lead to the conclusion that marketing is already well underway, but it intentionally supports groups that enable to achieve good performance. As a result, the government should provide clear priorities whether to support the companies in the underdevelopment or to aid better performance. Through our research, we have identified how public information infrastructure contributes to product competitiveness. Here, we can draw some policy implications. First, the public information support infrastructure should have the capability to enhance the ability to interact with or to find the expert that provides required information. Second, if the utilization of public information support (online) infrastructure is effective, it is not necessary to continuously provide informational mentoring, which is a parallel offline support. Rather, offline support such as mentoring should be used as an appropriate device for abnormal symptom monitoring. Third, it is required that SMEs should improve their ability to utilize, because the effect of enhancing networking capacity through public information support infrastructure and enhancing product competitiveness through such infrastructure appears in most types of companies rather than in specific SMEs.

Documentation of Intangible Cultural Heritage Using Motion Capture Technology Focusing on the documentation of Seungmu, Salpuri and Taepyeongmu (부록 3. 모션캡쳐를 이용한 무형문화재의 기록작성 - 국가지정 중요무형문화재 승무·살풀이·태평무를 중심으로 -)

  • Park, Weonmo;Go, Jungil;Kim, Yongsuk
    • Korean Journal of Heritage: History & Science
    • /
    • v.39
    • /
    • pp.351-378
    • /
    • 2006
  • With the development of media, the methods for the documentation of intangible cultural heritage have been also developed and diversified. As well as the previous analogue ways of documentation, the have been recently applying new multi-media technologies focusing on digital pictures, sound sources, movies, etc. Among the new technologies, the documentation of intangible cultural heritage using the method of 'Motion Capture' has proved itself prominent especially in the fields that require three-dimensional documentation such as dances and performances. Motion Capture refers to the documentation technology which records the signals of the time varing positions derived from the sensors equipped on the surface of an object. It converts the signals from the sensors into digital data which can be plotted as points on the virtual coordinates of the computer and records the movement of the points during a certain period of time, as the object moves. It produces scientific data for the preservation of intangible cultural heritage, by displaying digital data which represents the virtual motion of a holder of an intangible cultural heritage. National Research Institute of Cultural Properties (NRICP) has been working on for the development of new documentation method for the Important Intangible Cultural Heritage designated by Korean government. This is to be done using 'motion capture' equipments which are also widely used for the computer graphics in movie or game industries. This project is designed to apply the motion capture technology for 3 years- from 2005 to 2007 - for 11 performances from 7 traditional dances of which body gestures have considerable values among the Important Intangible Cultural Heritage performances. This is to be supported by lottery funds. In 2005, the first year of the project, accumulated were data of single dances, such as Seungmu (monk's dance), Salpuri(a solo dance for spiritual cleansing dance), Taepyeongmu (dance of peace), which are relatively easy in terms of performing skills. In 2006, group dances, such as Jinju Geommu (Jinju sword dance), Seungjeonmu (dance for victory), Cheoyongmu (dance of Lord Cheoyong), etc., will be documented. In the last year of the project, 2007, education programme for comparative studies, analysis and transmission of intangible cultural heritage and three-dimensional contents for public service will be devised, based on the accumulated data, as well as the documentation of Hakyeonhwadae Habseolmu (crane dance combined with the lotus blossom dance). By describing the processes and results of motion capture documentation of Salpuri dance (Lee Mae-bang), Taepyeongmu (Kang seon-young) and Seungmu (Lee Mae-bang, Lee Ae-ju and Jung Jae-man) conducted in 2005, this report introduces a new approach for the documentation of intangible cultural heritage. During the first year of the project, two questions have been raised. First, how can we capture motions of a holder (dancer) without cutoffs during quite a long performance? After many times of tests, the motion capture system proved itself stable with continuous results. Second, how can we reproduce the accurate motion without the re-targeting process? The project re-created the most accurate motion of the dancer's gestures, applying the new technology to drew out the shape of the dancers's body digital data before the motion capture process for the first time in Korea. The accurate three-dimensional body models for four holders obtained by the body scanning enhanced the accuracy of the motion capture of the dance.

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.


  • (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.