• Title/Summary/Keyword: Research Topic

Search Result 2,391, Processing Time 0.027 seconds

A Prestigious University Students' Perceptions of their Educational Attainment by a Topic model (토픽모델을 활용한 명문대 재학생의 학벌에 관한 인식 분석)

  • Young Son Jung;Seung-Yun Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.3
    • /
    • pp.503-512
    • /
    • 2024
  • This study examines the essays of academic background, written by students from a university, which is classified into prestigious universities in Korean society. By Latent Dirichlet Allocation, 172 essays were analyzed to explore the students' perspectives of the academic fractionalism. The analysis identified five topics such as, functional aspects (Topic 1), double-edged nature (Topic 2), power communities (Topic 3), symbols of victory (Topic 4), and dysfunctional aspects (Topic 5). The most frequently appearing keywords are 'individual,' 'status,' and 'means' in Topic 1, 'definition,' 'school,' and 'meaning' in Topic 2, 'people,' 'origin,' and 'power' in Topic 3, 'university,' 'ability,' and 'effort' in Topic 4, and 'academic achievement,' 'South Korea,' and 'origin' in Topic 5. By exploring the topics, we found that students regarded class reproduction by education as important social issues and they showed little interest in other factors influencing academic fractionalism, such as race or ethnicity. these findings suggest that professars, who teach the impact of education on academic fractionalism, deal with the influence of diverse factors on academic fractionalism.

Generative probabilistic model with Dirichlet prior distribution for similarity analysis of research topic

  • Milyahilu, John;Kim, Jong Nam
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.4
    • /
    • pp.595-602
    • /
    • 2020
  • We propose a generative probabilistic model with Dirichlet prior distribution for topic modeling and text similarity analysis. It assigns a topic and calculates text correlation between documents within a corpus. It also provides posterior probabilities that are assigned to each topic of a document based on the prior distribution in the corpus. We then present a Gibbs sampling algorithm for inference about the posterior distribution and compute text correlation among 50 abstracts from the papers published by IEEE. We also conduct a supervised learning to set a benchmark that justifies the performance of the LDA (Latent Dirichlet Allocation). The experiments show that the accuracy for topic assignment to a certain document is 76% for LDA. The results for supervised learning show the accuracy of 61%, the precision of 93% and the f1-score of 96%. A discussion for experimental results indicates a thorough justification based on probabilities, distributions, evaluation metrics and correlation coefficients with respect to topic assignment.

Research on the Movie Reviews Regarded as Unsuccessful in Box Office Outcomes in Korea: Based on Big Data Posted on Naver Movie Portal

  • Jeon, Ho-Seong
    • Asia-Pacific Journal of Business
    • /
    • v.12 no.3
    • /
    • pp.51-69
    • /
    • 2021
  • Purpose - Based on literature studies of movie reviews and movie ratings, this study raised two research questions on the contents of online word of mouth and the number of movie screens as mediator variables. Research question 1 wanted to figure out which topics of word groups had a positive or negative impact on movie ratings. Research question 2 tried to identify the role of the number of movie screens between movie ratings and box office outcomes. Design/methodology/approach - Through R program, this study collected about 82,000 movie reviews and movie ratings posted on Naver's movie website to examine the role of online word of mouths and movie screen counts in 10 movies that were considered commercially unsuccessful with fewer than 2 million viewers despite securing about 1,000 movie screens. To confirm research question 1, topic modeling, a text mining technique, was conducted on movie reviews. In addition, this study linked the movie ratings posted on Naver with information of KOBIS by date, to identify the research question 2. Findings - Through topic modeling, 5 topics were identified. Topics found in this study were largely organized into two groups, the content of the movie (topic 1, 2, 3) and the evaluation of the movie (topics 4, 5). When analyzing the relationship between movie reviews and movie ratings with 5 mediators identified in topic modeling to probe research question 1, the topic word groups related to topic 2, 3 and 5 appeared having a negative effect on the netizen's movie ratings. In addition, by connecting two secondary data by date, analysis for research question 2 was implemented. The outcomes showed that the causal relationship between movie ratings and audience numbers was mediated by the number of movie screens. Research implications or Originality - The results suggested that the information presented in text format was harder to quantify than the information provided in scores, but if content information could be digitalized through text mining techniques, it could become variable and be analyzed to identify causality with other variables. The outcomes in research question 2 showed that movie ratings had a direct impact on the number of viewers, but also had indirect effects through changes in the number of movie screens. An interesting point is that the direct effect of movie ratings on the number of viewers is found in most American films released in Korea.

A Study on Identifying Topics and Trends in International Cadastral Research Using LDA: With Special Reference to the FIG Peer Review Journal (LDA를 이용한 국제지적연구의 주제와 추세확인에 관한 연구: 특히 FIG Peer Review Journal을 중심으로)

  • kim, Yun-Ki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.48 no.1
    • /
    • pp.15-33
    • /
    • 2018
  • The main purpose of this study was to identify the topics and research trends of international cadastral research using LDA. To achieve this goal, I reviewed the literature on LDA and international cadastral study and formulated four research questions that are topics of cadastral researchers, distribution of topics, the most influential topics and changes of topics over time. To answer these research questions, I analyzed 370 papers published in the FIG Peer Review Journal between January 1, 2008, and October 31, 2017, using LDA. As a result of the analysis, I confirmed that there are twelve major topics in international cadastral research. And the most influential topic of these topics was identified as topic 2(cadastral information systems), and topic 5(land development and land administration) was also confirmed as playing an important role in the overall document. These two topics have been the most popular topics whose trendlines have been very active over the past decade and will play a leading role in future cadastral research.

The Trend of Published Articles to the Korean Journal of Oriental Preventive Medicine - From 1997 to 2010 - (대한예방한의학회지 게재논문의 경향성에 대한 연구 - 창간호(1997년)로부터 2010년까지 -)

  • Park, Hae-Mo
    • Journal of Society of Preventive Korean Medicine
    • /
    • v.15 no.1
    • /
    • pp.17-27
    • /
    • 2011
  • Objective : The purpose of this study was to identify the trend of research in the Korean Journal of Oriental Preventive Medicine and to suggest future perspective for oriental preventive medicine research. Method : The contents of 344 articles published in this journal was reviewed from its beginning year 1997 to year 2010. Result : The number of articles was increased as times go on. An analysis of the research design showed, experimental research (in vivo or in vitro) was 36.9%, survey research was 26.5%, review was 20.1%. In the major classifications of topics published, health management 28.5%, oriental medicine effectiveness 25.3%, herbal safety and toxicity 13.1%, and environmental and occupational medicine 9.0% respectively. Conclusion : There has been a lack of health preservation(Yang-saeng) topic, epidemiology and health statistics topic. Further research need qualitative study and each subjects of oriental preventive medicine.

Review of Wind Energy Publications in Korea Citation Index using Latent Dirichlet Allocation (잠재디리클레할당을 이용한 한국학술지인용색인의 풍력에너지 문헌검토)

  • Kim, Hyun-Goo;Lee, Jehyun;Oh, Myeongchan
    • New & Renewable Energy
    • /
    • v.16 no.4
    • /
    • pp.33-40
    • /
    • 2020
  • The research topics of more than 1,900 wind energy papers registered in the Korean Journal Citation Index (KCI) were modeled into 25 topics using latent directory allocation (LDA), and their consistency was cross-validated through principal component analysis (PCA) of the document word matrix. Key research topics in the wind energy field were identified as "offshore, wind farm," "blade, design," "generator, voltage, control," 'dynamic, load, noise," and "performance test." As a new method to determine the similarity between research topics in journals, a systematic evaluation method was proposed to analyze the correlation between topics by constructing a journal-topic matrix (JTM) and clustering them based on topic similarity between journals. By evaluating 24 journals that published more than 20 wind energy papers, it was confirmed that they were classified into meaningful clusters of mechanical engineering, electrical engineering, marine engineering, and renewable energy. It is expected that the proposed systematic method can be applied to the evaluation of the specificity of subsequent journals.

Analysis of sustainable fashion research trends using topic modeling (토픽 모델링을 이용한 지속가능패션 연구 동향 분석)

  • Lee, Hana
    • The Research Journal of the Costume Culture
    • /
    • v.29 no.4
    • /
    • pp.538-553
    • /
    • 2021
  • As interest in the sustainable fashion industry continues to increase along with climate issues, it is necessary to identify research trends in sustainable fashion and seek new development directions. Therefore, this study aims to analyze research trends on sustainable fashion. For this purpose, related papers were collected from the KCI (Korean Citation Index) and Scopus, and 340 articles were used for the study. The collected data went through data transformation, data preprocessing, topic modeling analysis, core topic derivation, and visualization through a Python algorithm. A total of eight topics were obtained from the comprehensive analysis: consumer clothing consumption behavior and environment, upcycle product development, product types by environmental approach, ESG business activities, materials and material development, process-based approach, lifestyle and consumer experience, and brand strategy. Topics were related to consumption, production, and education of sustainable fashion, respectively. KCI analysis results and Scopus analysis results derived eight topics but showed differences from the comprehensive analysis results. This study provides primary data for exploring various themes of sustainable fashion. It is significant in that the data were analyzed based on probability using a research method that excluded the subjective value of the researcher. It is recommended that follow-up studies be conducted to examine social trends.

Cancer Research Trends in Traditional Korean Medical Journals since 2000 - Topic Modeling Using Latent Dirichlet Allocation and Keyword Network Analysis (2000년 이후 국내 한의학 암 관련 연구 동향 분석 - Latent Dirichlet Allocation 기반 토픽 모델링 및 연관어 네트워크 분석)

  • Kyeore Bae
    • The Journal of Internal Korean Medicine
    • /
    • v.43 no.6
    • /
    • pp.1075-1088
    • /
    • 2022
  • Objectives: The aim of this study is to analyze cancer research trends in traditional Korean medical journals indexed in the Korea Citation Index since 2000. Methods: Cancer research papers published in traditional Korean medical journals were searched in databases from inception to October 2022. The numbers of publications by journal and by year were descriptively assessed. After natural language processing, topic modeling (based on Latent Dirichlet allocation) and keyword network analysis were conducted. Results: This research trend analysis involved 1,265 papers. Six topics were identified by topic modeling: case reports on symptom management, literature reviews, experiments on apoptosis, herbal extract treatments of breast carcinoma cell lines, anti-proliferative effects of herbal extracts, and anti-tumor effects. Keyword network analysis found that the effects of herbal medicine were assessed in clinical and experimental studies, while acupuncture was mainly mentioned in clinical reports. Conclusions: Cancer research papers in traditional Korean medical journals have contributed to evidence-based medicine. Further experimental studies are needed to elucidate the effects of on different hallmarks of cancer. Rigorous clinical studies are needed to support clinical guidelines.

A Study on the Trends of Construction Safety Accident in Unstructured Text Using Topic Modeling (비정형 텍스트 기반의 토픽 모델링을 이용한 건설 안전사고 동향 분석)

  • Lee, Sang-Gyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.10
    • /
    • pp.176-182
    • /
    • 2018
  • In order to understand and track the trends of construction safety accident, this study shows the topic trends in the construction safety accident with LDA(Latent Dirichlet Allocation)-based topic modeling method for data analytics. Especially, it performs to figure out the main issue of construction safety accident with unstructured data analysis based on the topic modeling rather than a variety of structured data analysis for preventing to safety accident in construction industry. To apply this methodology, I randomly collected to 540 news article data about construction accident from January 2017 to February 2018. Based on the unstructured data with the LDA-based topic modeling, I found the 10 topics and identified key issues through 10 keyword in each 10 topics. I forecasted the topic issue related to construction safety accident based on analysis of time-series trends about the news data from January 2017 to February 2018. With this method, this research gives a hint about ways of using unstructured news article data to anticipate safety policy and research field and to respond to construction accident safety issues in the future.

Combining Ego-centric Network Analysis and Dynamic Citation Network Analysis to Topic Modeling for Characterizing Research Trends (자아 중심 네트워크 분석과 동적 인용 네트워크를 활용한 토픽모델링 기반 연구동향 분석에 관한 연구)

  • Yu, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.1
    • /
    • pp.153-169
    • /
    • 2015
  • The combined approach of using ego-centric network analysis and dynamic citation network analysis for refining the result of LDA-based topic modeling was suggested and examined in this study. Tow datasets were constructed by collecting Web of Science bibliographic records of White LED and topic modeling was performed by setting a different number of topics on each dataset. The multi-assigned top keywords of each topic were re-assigned to one specific topic by applying an ego-centric network analysis algorithm. It was found that the topical cohesion of the result of topic modeling with the number of topic corresponding to the lowest value of perplexity to the dataset extracted by SPLC network analysis was the strongest with the best values of internal clustering evaluation indices. Furthermore, it demonstrates the possibility of developing the suggested approach as a method of multi-faceted research trend detection.