• Title/Summary/Keyword: exploratory data analysis

Search Result 1,337, Processing Time 0.03 seconds

An Exploratory Study on the Project Performance by PMO Capability (PMO 역량에 따른 프로젝트 성과에 관한 연구)

  • Bae, Jae-Kwon;Kim, Jin-Hwa;Kim, Sang-Yeoul
    • Asia pacific journal of information systems
    • /
    • v.18 no.1
    • /
    • pp.53-77
    • /
    • 2008
  • In recent years, although numbers of corporations are bringing in PMO, they seem to be indifferent to PMO performance measurement. This demonstrates that there are also other reasons beside performance measurement of information systems (IS) project being ambiguous by introducing PMO; the lack of acknowledging the concrete function of PMO, and the scarcity of empirical study about the effect of PMO on the project members and project performance. In this sense, this study is aimed at proposing a new research model in which project success factors (i.e., standardization, management advocacy, and staff expertise) affect PMO capability (i.e., knowledge management, resources management, and problem solving competency) positively, leading to project performance (i.e., task outcomes, psychological outcomes, and organizational outcomes) eventually. To empirically test the research model, data are surveyed from PMO department and IS department. To prove the validity of the proposed research model, PLS analysis is applied with valid 132 questionnaires. By employing PLS technique, the measurement reliability and validity of research variables are tested and the path analysis is conducted to do the hypothesis testing. The path analysis results can be organized into 7 ways in large scale. First, standardization of project success factors has a positive association with knowledge management, resources management, and problem solving competency of PMO capabilities. The findings of this result indicate that the multiple or single project management should satisfy standardization in order to operate an effective PMO. Second, management advocacy of project success factors has a positive association with knowledge management, resources management, and problem solving competency. Management advocacy refers to the willingness of management to provide the required resources and authority for project success. There is agreement among researchers regarding the importance of management advocacy for favorable PMO capability. Third, staff expertise of project success factors has a positive association with knowledge management, resources management, and problem solving competency. The findings of this result indicate that the formation of an exceptional consultant or members with a proficient knowledge for staff expertise of project member is the key factor to elevate the PMO capability. Past research suggests that experience and knowledge and the resultant familiarity with the problem faced can be an important determinant of PMO capability. A capable project with appropriate staff expertise means that it enjoys a diversity of abilities and experiences. Fourth, knowledge management competency of PMO capabilities has a positive impact on psychological outcomes but has no direct effect on task outcomes and organizational outcomes. In domestic case of S. Korea, PMO was finally introduced to many other corporations in 2005 though it started bringing in 2000. Therefore, it had neither a significant impact on the task outcomes nor organizational outcomes by lacking the contents and the infrastructure of the knowledge management because the knowledge consolidation and management period of PMO is comparatively shorter by terms than other foreign nations. Fifth, resources management competency of PMO capabilities has a positive association with task outcomes, psychological outcomes, and organizational outcomes. In addition, problem solving competency of PMO capabilities has a positive association with task outcomes, psychological outcomes, and organizational outcomes. Therefore, the findings of this results stress that PMO capabilities has a positive impact on project performance. Sixth, according to the path analysis of the hypothesis, which suggested in this research, problem solving competency is the PMO capability which is the key success factor for task, psychological, and organizational outcomes as an integrated performance model. Further, the analysis reveals that problem solving competency is an important factor for integrated performance model. The finding is in line with past IS research, which affirms that the work of IS projects is essentially a problem solving endeavor. Seventh, in the path analysis of the hypothesis in this research, the path of the management advocacy $\rightarrow$ problem solving competency $\rightarrow$ organizational outcomes appears to be the most important and strongest path. In brief, the finding of this study suggests that project success factors influence PMO capability positively, and project performance as well. From the results, it can be concluded that PMO helped great improve the project success rate and project performance. This study advances research on PMO capability in three important aspects. First, the findings of our study have implications for IS theory and future research. Our study contributes to IS theory by synthesizing concepts from PMO research and project management research with those in IS research. We proposed and tested PMO capability of IS projects and the findings of our investigation provided some preliminary answers to some of the questions raised. Secondly, this thesis does not only help depicting the concept of IT governance but also approaches empirically. It makes a gradual approach to the main content, step by step, in contrary of simple standard, scholastic way of thinking. Finally, we argued that this task-oriented(technical) view is not sufficient to adequately conceptualize IS project performance. Hence, we applied that the research on organization teams, which provides a flip viewpoint to that of project management research in that it gives more weight for psychological outcomes of organizational work groups, can be very helpful in reconceptualizing the IS project performance construct. The limitations of this study are also discussed to provide research directions for future research.

The Effect of Consumer's Perceptual Characteristics for PB Products on Relational Continuance Intention: Mediated by Brand Trust and Brand Equity (PB상품에 대한 소비자의 지각특성이 관계지속의도에 미치는 영향: 브랜드신뢰 및 브랜드자산을 매개로 한 정책적 접근)

  • Lim, Chaekwan
    • Journal of Distribution Research
    • /
    • v.17 no.5
    • /
    • pp.85-111
    • /
    • 2012
  • Introduction : The purpose of this study was to examine the relationship between perceptual characteristics of consumers and intent of relational continuance for PB(Private Brand) products in discount stores. This study was conducted as an empirical study based on survey. For the empirical study, factors of PB products as characteristics perceived by consumers such as perceived quality, store image, brand image and perceived value were deduced from preceding studies. The effect of such factors on intent of relational continuance mediated by brand trust and brand equity of PB products was structurally examined. Research Model : Based on theory analysis and hypotheses, constructed a Structural Equation Model(SEM). The research model is shown in Figure 1. Research Method : This paper is based on s qualitative study of selected literature and empirical data. The survey for empirical study was carried out on consumers in Gyeonggi and Busan between January 2012 and May 2012. 300 surveys were distributed and 253 (84.3%) of them were returned. After excluding omissions and insincere responses, 245 surveys (81.6%) were used for final analysis as effective samples. Result : First of all, the Reliability was carried out for instrument used. The lower limit of 0.7 for Cronbach's Alpha as suggested by Hair et al. (1998). And Construct validity was established by carrying out exploratory factor analysis by Varimax rotation for all. Four factor result for the consumer's perceptual characteristics of PB Products, two mediating factors and one dependent factor. All constructs included in research framework have acceptable validity and reliability. Table 1 shows the factor loading, eigen value, explained variance and Cronbach's alpha for each factor. In order to assure validity of constructs, I implemented Confirmatory Factor Analysis (CFA), using AMOS 20.0. In confirmatory factor analysis, researcher can take control over the specification of indicators for each factor by hypothesizing that a specific factor is loaded with the relevant indicators. Moreover, CFA is particularly useful in the validation of scale for the measurement of specific construct. CFA result summarized Table 2 shows that the fit measures of all constructs fulfill the recommended level and loadings are significant. To test causal relationship between constructs in the research model, used AMOS 20.0 that provides a graphic module as method for analysing Structural Equation Modeling. The result of hypothesis test is shown in Table 3. As a result of empirical study, perceived quality, brand image and perceived value as selected attributes for PB products showed significantly positive (+) effect on brand trust and brand equity. Furthermore, brand trust and brand equity showed significantly positive (+) effect on intent of relational continuance. However, store image of discount stores selling the PB products was analyzed to have positive (+) effect on brand trust and no significant effect on brand equity. Discussion : Based on the results of this study, the relationship between overall quality, store image, brand image and value perceived by consumers about PB products and intent of relational continuance was structurally verified as being mediated by brand trust and brand equity. Looking at the results, a strategic approach that maximizes brand trust and equity value for PB products by large discount stores is required on top of basic efforts to improve quality, brand image and value of PB products in order to maximize consumer's intent of relational continuance and to continuously attract repeated purchase of products.

  • PDF

Factorial Validity of the Korean Version of the Illness Intrusive Rating Scale among Psychiatric Outpatients Mainly Diagnosed with Anxiety or Depressive Disorders (불안 및 우울장애를 주요 진단으로 하는 정신건강의학과 외래환자 대상 한국판 질병침습도 평가척도의 요인 타당도 연구)

  • Cho, Yubin;Kim, Daeho;Kim, Eunkyung;Jo, Hwa Yeon;Yun, Mirim;Lee, Hoseon
    • Korean Journal of Psychosomatic Medicine
    • /
    • v.27 no.2
    • /
    • pp.77-84
    • /
    • 2019
  • Objectives : The Illness Intrusiveness Rating Scale (IIRS) is a well-validated self-report instrument for assessing negative impact of chronic illness and/or adverse effects of its treatment on everyday life domains. Although extensive literature probed its psychometric properties in medical illness, little attention was paid for its validity for psychiatric population. This study aimed to test factorial structure of the Korean Version of the IIRS (IIRS-K) in a consecutive sample of psychiatric outpatients. Methods : Data set of 307 first-visit patients of psychiatric clinic at Guri Hanyang univ. Hospital were used. Exploratory and confirmatory factor analysis, internal consistency were tested in IIRS-K. We also checked Spearman's correlation analysis between IIRS-K, Zung's self-report anxiety scale and Zung's self-report depression scale. Results : 76.9% of the patients were with anxiety disorder and depressive disorder. The principal component factor analysis of the IIRS-K extracted three-factor structure accounted for 63.2% of total variance that was contextually similar to the original English version. This three-factor solution showed the best fit when tested confirmatory factor analysis compared to the original IIRS, two-factor model of IIRS-K suggested from medical outpatients, and one-factor solution. The IIRS-K also showed good internal consistency (Cronbach's α=0.90) and good convergent validity with anxiety and depression scales. Conclusions : The IIRS-K showed the three-factor structure that was similar but not identical to original version. Overall, this study proved factorial validity of the IIRS-K and it can be used for Korean clinical population.

The Nature of Patient's Disagreement with Doctors among Some Rural Residents (일부 농촌주민에서 의사에 대한 환자의 의견불일치)

  • Lee, Moo-Sik;Cho, Hyong-Won;Kim, Eun-Young;Chun, Byung-Chul;Shin, Dong-Hoon
    • Journal of agricultural medicine and community health
    • /
    • v.24 no.2
    • /
    • pp.315-329
    • /
    • 1999
  • Recently, dissatisfaction with aspects of health care has been complemented by directly at complaints such as informal, formal and litigation. But some people take action and other not in spite of feeling of dissatisfaction. This study was to investigate an accounts of patient's disagreement with doctor's care from a community sample, and make a distinction between felt disagreement and disagreement actions. This study was done in six hundred forty residents in Sungjoo County of Kyungbuk Province and Nonman city of Chungnam Province. The questionnaires of interview included sociodemographic data, health status data, a nature of patient's disagreement with doctor and actions taken following or during the disagreement episode. Approximately sixteen percent of sample reported a disagreement, and nine percent reported action taken following or during the disagreement episode. Age, educational attainment, income and area were significantly related with experience of disagreement episode in univariate analysis. In people who experienced the disagreement episode, nearly forty-one percent reported on disagreement about the diagnosis related, twenty-eight percent reported doctor-patients relationship related, twenty percent reported treatment related, and eleven percent reported prescription drug related. In people who experienced actions taken following or during the disagreement episode, nearly fifty-four percent acted as 'sought a second opinion or visit other doctor', thirty-six percent acted as 'verbally challenged the doctor', thirty-two percent acted as 'stopped prescribed treatment or medication', twenty-nine percent acted as 'made repeat visits to the same doctor', twenty-five percent acted as 'eventually left and changed doctor'. Results of multivariate analysis, age, marital status, have or haven't chronic disease, and general satisfaction with health service were significantly related with experience of disagreement episode and marital status was significantly related with experience of actions taken following or during the disagreement episode. This study is experimental and exploratory trial about a relationship between patient's disagreement with doctor and actions taken following or during the disagreement episode in some community of Korea. We find that patient's disagreement with doctor and actions taken following or during the disagreement episode is latent in our community. We suggest that the relationship between felt disagreement and disagreement action is more complicated and worthy of further study.

  • PDF

A Study on Evaluation of Visual Factor for Measuring Subjective Virtual Realization (주관적인 가상 실감화 측정 방법에 대한 시각적 요소 평가 연구)

  • Won, Myeung-Ju;Park, Sang-In;Kim, Chi-Jung;Lee, Eui-Chul;Whang, Min-Cheol
    • Science of Emotion and Sensibility
    • /
    • v.15 no.3
    • /
    • pp.389-398
    • /
    • 2012
  • Virtual worlds have pursued reality as if they actually exist. In order to evaluate the sense of reality in the computer-simulated worlds, several subjective questionnaires, which include specific independent variables, have been proposed in the literature. However, the questionnaires lack reliability and validity necessary for defining and measuring the virtual realization. Few studies have been conducted to investigate the effect of visual factors on the sense of reality experienced by exposing to a virtual environment. Therefore, this study was aimed at reinvestigating the variables and proposing a more reliable and advisable questionnaire for evaluating the virtual realization, focusing on visual factors. Twenty-one questions were gleaned from the literature and subjective interviews with focused groups. Exploratory factor analysis with oblique rotation was performed on the data obtained from 200 participants(females: 100) after exposing to a virtual character image described in an extreme way. After removing poorly loading items, remained subsets were subjected to confirmatory factor analysis on the data obtained from the same participants. As a result, 3 significant factors were determined to efficiently measure the virtual realization. The determined factors included visual presence(3 subset items), visual immersion(7 subset items), and visual interactivity(4 subset items). The proposed factors were verified by conducting a subjective evaluation in which participants were asked to evaluate a 3D virtual eyeball model based on the visual presence. The results implicated that the measurement method was suitable for evaluating the degree of the virtual realization. The proposed method is expected to reasonably measure the degree of the virtual realization.

  • PDF

The Effect of Participation Degree in Sports for all of People with Physical Disabilities on Positive Psychological Capital(PPC) (지체장애인의 생활체육 참여정도가 긍정심리자본(PPC)에 미치는 영향)

  • Kim, Dae-Kyung;Park, Jin-Woo;Kim, Hye-Min;Lee, Hyun-Su
    • 한국체육학회지인문사회과학편
    • /
    • v.54 no.5
    • /
    • pp.867-876
    • /
    • 2015
  • This study was intended to closely examine an effect that the level of physically challenged person's participation in community sports had on positive psychological capital. In order to accomplish the purpose of study, data on 212 physically challenged persons who lived in B city and participated in community sports were analyzed. Korean version of positive psychological capital created by Taehong Lim (2014) through the reconstruction of scales developed by Luthans, Youssef and Avolio(2007) and Sangwan Jeon and Jonghun Yang's (2009) level of participation in community sports was reconstructed through modification·improvement as measurement instrument. An exploratory factor analysis, reliability test, paired difference test, and multiple regression analysis was carried out by using SPSS 18.0 program for data processing. First, It was shown that there was a significant difference in positive psychological capital according to gender, age, and disability grade among physically challenged persons' socio-demographic characteristics. Second, it was shown that, among sub-variables (period, frequency and intensity) of level of physically challenged persons' participation in community sports, the frequency of participation and the intensity of participation had a significant effect on self efficacy. On the other hand, it was shown that the period of participation didn't have a significant effect. Third, it was shown that the frequency of participation had a significant effect on optimism. On the other hand, it was shown that the period of participation and the intensity of participation didn't have a significant effect. Fourth, it was shown that the frequency of participation and the intensity of participation had a significant effect on hope. On the other hand, it was shown that no significant effect was produced on the period of participation. Fifth, it was shown that the frequency of participation had a significant effect on resilience. On the other hand, it was shown that no significant effect was produced on the period of participation and the intensity of participation. Sixth, it was shown that the frequency of participation and the intensity of participation had a significant effect on positive psychological capital. And it was shown that no significant effect was produced on the period of participation.

The Effect of Chinese Customer Coffee Benefit Sought on Korean Coffee Shop Satisfaction, Attachment, and Loyalty - Based on Mediating Effect of Korean Wave Attitude - (중국소비자의 커피제품 추구편익이 한국 커피전문점 만족도와 애착 및 충성도에 미치는 영향에 관한 연구 - 한류태도 매개효과를 중심으로 -)

  • Lee, Hyung-Ju;Suh, Ji-Youn
    • Culinary science and hospitality research
    • /
    • v.22 no.5
    • /
    • pp.151-166
    • /
    • 2016
  • The purpose of this study is to understand the influence of Chinese customer coffee sought benefits on satisfaction with, and attachment and loyalty to Korean coffee shops. Based on a total of 200 samples obtained for empirical research from 10 Mar. to 25 July, 2015, of self-administrated questionaries completed by patrons in Beijing, Shanghai, Haerbin in China, data were analyzed for frequency, exploratory factor analysis, reliability analysis, correlation analysis, multiple regression and hierarchical multiple regression analysis. The results of this study are summarized as follows. First, it was found that Chinese customer sought pursuits (functional & experimental benefits, symbolic benefit) had an effect on satisfaction of Korea coffee shop. Second, satisfaction influenced Korean coffee shop attachment and loyalty. Third, Korean wave attitude had a mediating effect between satisfaction, attachment and loyalty. From the results, we can conclude following implications: First, by providing atmosphere of South Korea, menu, barista service, we can predict that Korean coffee brands can prevail in competition through active promotions of their brands. Second, Korean coffee brands can make a strategy that includes providing full service from trained South Korean baristas and hosting talk shows between baristas from South Korea. Third, providing the opportunity to visit South Korea for local cafe tours is a good social event. These results will help control marketing strategies in China. Limitations and future research directions are also discussed.

A Validating Academic Engagement as a Multidimensional Construct for Korean College Students: Academic Motivation, Engagement, and Satisfaction (대학생용 학업참여 척도(UWES-S)의 타당화: 학업동기, 참여 및 만족도의 구조적 관계)

  • Choo, Huntaek;Sohn, Wonsook
    • Korean Journal of School Psychology
    • /
    • v.9 no.3
    • /
    • pp.485-503
    • /
    • 2012
  • Academic engagement has been known as a strong predictor of students' cognitive and affective outcomes in an educational context. Despite increasing interest and theoretical usefulness of this construct, a few researchers seem to be interested in the validation of instruments to measure academic engagement for Korean students. Thus, this study would like to introduce one of academic scales widely used, UWES-S(Utrecht Work Engagement Scale-Student) (Schaufeli et al., 2002a: 2002b) and to validate the UWES-S for Korean college students. To validate the Korean version of the UWES-S, 651 college students (285 for Field Trial, 366 for Main Study) were used. The procedure is as follows. First, we used an integrated adaptation procedure to produce a Korean version of the UWES-S. Second, EFA(exploratory factor analyses) was applied to explore the factor structure of the UWES-S on the field trial data. Third, the psychometric properties of the UWES-S items were examined by graded response model(GRM). Also CFA(confirmatory factor analysis) was used to examine its internal construct validity for the data from the main study. Finally, the external validity of the UWES-S was scrutinized with the related variables such as academic motivation and satisfaction. As a result, the Korean version of the UWES-S with 13 items was accepted that the four items were excluded from its original version. Second, the internal validity was supported that the 3 factor CFA model(vigor, dedication, absorption) fit the data well. Third, we supported the partial mediation model that academic engagement played as a mediating variable between academic motivation(internal/external) and academic satisfaction. Finally, the differences between a validation of UWES-S for Korean college and high school students, the necessity of construct equivalence testing, and direction for future research of scale validating were discussed.

Service Quality, Customer Satisfaction and Customer Loyalty of Mobile Communication Industry in China (중국이동통신산업중적복무질량(中国移动通信产业中的服务质量), 고객만의도화고객충성도(顾客满意度和顾客忠诚度))

  • Zhang, Ruijin;Li, Xiangyang;Zhang, Yunchang
    • Journal of Global Scholars of Marketing Science
    • /
    • v.20 no.3
    • /
    • pp.269-277
    • /
    • 2010
  • Previous studies have shown that the most important factor affecting customer loyalty in the service industry is service quality. However, on the subject of whether service quality has a direct or indirect effect on customer loyalty, scholars' views apparently vary. Some studies suggest that service quality has a direct and fundamental influence on customer loyalty (Bai and Liu, 2002). However, others have shown that service quality not only directly affects customer loyalty, it also has an indirect impact on customer loyalty by influencing customer satisfaction and perceived value (Cronin, Brady, and Hult, 2000). Currently, there are few domestic articles that specifically address the relationship between service quality and customer loyalty in the mobile communication industry. Moreover, research has studied customer loyalty as a whole variable, rather than breaking it down further into multiple dimensions. Based on this analysis, this paper summarizes previous study results, establishes an effect mechanism model among service quality, customer satisfaction, and customer loyalty in the mobile communication industry, and presents a statistical test on model assumptions by using customer investigation data from Heilongjiang Mobile Company. It provides theoretical guidance for mobile service management based on the discussion of the hypothesis test results. For data collection, the sample comprised mobile users in Harbin city, and the survey was taken by random sampling. Out of a total of 300 questionnaires, 276 (92.9%) were recovered. After excluding invalid questionnaires, 249 remained, for an effective rate of 82.6 percent for the study. Cronbach's ${\alpha}$ coefficient was adapted to assess the scale reliability, and validity testing was conducted on the questionnaire from three aspects: content validity, construct validity. and convergent validity. The study tested for goodness of fit mainly from the absolute and relative fit indexes. From the hypothesis testing results, overall, four assumptions have not been supported. The ultimate affective relationship of service quality, customer satisfaction, and customer loyalty is demonstrated in Figure 2. On the whole, the service quality of the communication industry not only has a direct positive significant effect on customer loyalty, it also has an indirect positive significant effect on customer loyalty through service quality; the affective mechanism and extent of customer loyalty are different, and are influenced by each dimension of service quality. This study used the questionnaires of existing literature from home and abroad and tested them in empirical research, with all questions adapted to seven-point Likert scales. With the SERVQUAL scale of Parasuraman, Zeithaml, and Berry (1988), or PZB, as a reference point, service quality was divided into five dimensions-tangibility, reliability, responsiveness, assurance, and empathy-and the questions were simplified down to nineteen. The measurement of customer satisfaction was based mainly on Fornell (1992) and Wang and Han (2003), ending up with four questions. Based on the study’s three indicators of price tolerance, first choice, and complaint reaction were used to measure attitudinal loyalty, while repurchase intention, recommendation, and reputation measured behavioral loyalty. The collection and collation of literature data produced a model of the relationship among service quality, customer satisfaction, and customer loyalty in mobile communications, and China Mobile in the city of Harbin in Heilongjiang province was used for conducting an empirical test of the model and obtaining some useful conclusions. First, service quality in mobile communication is formed by the five factors mentioned earlier: tangibility, reliability, responsiveness, assurance, and empathy. On the basis of PZB SERVQUAL, the study designed a measurement scale of service quality for the mobile communications industry, and obtained these five factors through exploratory factor analysis. The factors fit basically with the five elements, indicating the concept of five elements of service quality for the mobile communications industry. Second, service quality in mobile communications has both direct and indirect positive effects on attitudinal loyalty, with the indirect effect being produced through the intermediary variable, customer satisfaction. There are also both direct and indirect positive effects on behavioral loyalty, with the indirect effect produced through two intermediary variables: customer satisfaction and attitudinal loyalty. This shows that better service quality and higher customer satisfaction will activate the attitudinal to service providers more active and show loyalty to service providers much easier. In addition, the effect mechanism of all dimensions of service quality on all dimensions of customer loyalty is different. Third, customer satisfaction plays a significant intermediary role among service quality and attitudinal and behavioral loyalty, indicating that improving service quality can boost customer satisfaction and make it easier for satisfied customers to become loyal customers. Moreover, attitudinal loyalty plays a significant intermediary role between service quality and behavioral loyalty, indicating that only attitudinally and behaviorally loyal customers are truly loyal customers. The research conclusions have some indications for Chinese telecom operators and others to upgrade their service quality. Two limitations to the study are also mentioned. First, all data were collected in the Heilongjiang area, so there might be a common method bias that skews the results. Second, the discussion addresses the relationship between service quality and customer loyalty, setting customer satisfaction as mediator, but does not consider other factors, like customer value and consumer features, This research will be continued in the future.

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.


  • (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.