• Title/Summary/Keyword: LDA Method

Search Result 270, Processing Time 0.028 seconds

A first-principles theoretical investigation of the structural, electronic and magnetic properties of cubic thorium carbonitrides ThCxN(1-x)

  • Siddique, Muhammad;Rahman, Amin Ur;Iqbal, Azmat;Azam, Sikander
    • Nuclear Engineering and Technology
    • /
    • v.51 no.5
    • /
    • pp.1373-1380
    • /
    • 2019
  • Besides promising implications as fertile nuclear materials, thorium carbonitrides are of great interest owing to their peculiar physical and chemical properties, such as high density, high melting point, good thermal conductivity. This paper reports first-principles simulation results on the structural, electronic and magnetic properties of cubic thorium carbonitrides $ThC_xN_{(1-x)}$ (X = 0.03125, 0.0625, 0.09375, 0.125, 0.15625) employing formalism of density-functional-theory. For the simulation of physical properties, we incorporated full-potential linearized augmented plane-wave (FPLAPW) method while the exchange-correlation potential terms in Kohn-Sham Equation (KSE) are treated within Generalized-Gradient-Approximation (GGA) in conjunction with Perdew-Bruke-Ernzerhof (PBE) correction. The structural parameters were calculated by fitting total energy into the Murnaghan's equation of state. The lattice constants, bulk moduli, total energy, electronic band structure and spin magnetic moments of the compounds show dependence on the C/N concentration ratio. The electronic and magnetic properties have revealed non-magnetic but metallic character of the compounds. The main contribution to density of states at the Fermi level stems from the comparable spectral intensity of Th (6d+5f) and (C+N) 2p states. In comparison with spin magnetic moments of ThSb and ThBi calculated earlier with LDA+U approach, we observed an enhancement in the spin magnetic moments after carbon-doping into ThN monopnictide.

Detection of Complaints of Non-Face-to-Face Work before and during COVID-19 by Using Topic Modeling and Sentiment Analysis (동적 토픽 모델링과 감성 분석을 이용한 COVID-19 구간별 비대면 근무 부정요인 검출에 관한 연구)

  • Lee, Sun Min;Chun, Se Jin;Park, Sang Un;Lee, Tae Wook;Kim, Woo Ju
    • The Journal of Information Systems
    • /
    • v.30 no.4
    • /
    • pp.277-301
    • /
    • 2021
  • Purpose The purpose of this study is to analyze the sentiment responses of the general public to non-face-to-face work using text mining methodology. As the number of non-face-to-face complaints is increasing over time, it is difficult to review and analyze in traditional methods such as surveys, and there is a limit to reflect real-time issues. Approach This study has proposed a method of the research model, first by collecting and cleansing the data related to non-face-to-face work among tweets posted on Twitter. Second, topics and keywords are extracted from tweets using LDA(Latent Dirichlet Allocation), a topic modeling technique, and changes for each section are analyzed through DTM(Dynamic Topic Modeling). Third, the complaints of non-face-to-face work are analyzed through the classification of positive and negative polarity in the COVID-19 section. Findings As a result of analyzing 1.54 million tweets related to non-face-to-face work, the number of IDs using non-face-to-face work-related words increased 7.2 times and the number of tweets increased 4.8 times after COVID-19. The top frequently used words related to non-face-to-face work appeared in the order of remote jobs, cybersecurity, technical jobs, productivity, and software. The words that have increased after the COVID-19 were concerned about lockdown and dismissal, and business transformation and also mentioned as to secure business continuity and virtual workplace. New Normal was newly mentioned as a new standard. Negative opinions found to be increased in the early stages of COVID-19 from 34% to 43%, and then stabilized again to 36% through non-face-to-face work sentiment analysis. The complaints were, policies such as strengthening cybersecurity, activating communication to improve work productivity, and diversifying work spaces.

A Study on Analysis of National Petition Data for Deriving Current Issues in Education (교육관련 이슈 도출을 위한 국민청원 데이터 분석 연구)

  • Min, Jeongwon;Shim, Jaekwoun
    • Journal of Creative Information Culture
    • /
    • v.6 no.2
    • /
    • pp.57-64
    • /
    • 2020
  • As the information society gradually advances, various opinions overflow and their complexity increases. As the results, it was made more difficult to derive important issues and properly respond to those problems. Accordingly, it is necessary to get a handle on emerging problems in education in addition to existing discourses and issues. This study aimed at examining the issues of education by analyzing the petitions posted under 'parenting and education' category on National Petition board. In order to offer objective and detailed results, we employed the topic modeling based LDA algorithm, which is an effective method to extract topics in multiple documents. Nine topics were derived as the result of the analysis and the relationship among those topics was visualized. The values of this study exist in that the derived topics represent important issues that reflect the public opinions.

An Analysis of the International Trends of Research on Artificial Intelligence in Education Using Topic Modeling (인공지능 활용 교육의 토픽모델링 분석을 통한 수학교육 연구 방향의 함의)

  • Noh, Jihwa;Ko, Ho Kyoung;Kim, Byeongsoo;Huh, Nan
    • Journal of the Korean School Mathematics Society
    • /
    • v.26 no.1
    • /
    • pp.1-19
    • /
    • 2023
  • This study analyzed the international trends of research concerning artificial intelligence in education by examining 352 papers recently published in the International Journal of Artificial Intelligence in Education(IJAIED) with the topic modeling method. The IJAIED is the official, SCOPUS-indexed journal of the International AIED Society. The analysis revealed that international AIED research trends could be categorized into eight topics with topics such as analyzing student behavior model in learning systems and designing feedback to student solutions being increased over time, whereas research focusing on data handling methods was decreased over time. Based on the findings implications and suggestions for the research and development of the applications of AIED were provided.

3D Face Recognition using Wavelet Transform Based on Fuzzy Clustering Algorithm (펴지 군집화 알고리즘 기반의 웨이블릿 변환을 이용한 3차원 얼굴 인식)

  • Lee, Yeung-Hak
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.11
    • /
    • pp.1501-1514
    • /
    • 2008
  • The face shape extracted by the depth values has different appearance as the most important facial information. The face images decomposed into frequency subband are signified personal features in detail. In this paper, we develop a method for recognizing the range face images by multiple frequency domains for each depth image using the modified fuzzy c-mean algorithm. For the proposed approach, the first step tries to find the nose tip that has a protrusion shape on the face from the extracted face area. And the second step takes into consideration of the orientated frontal posture to normalize. Multiple contour line areas which have a different shape for each person are extracted by the depth threshold values from the reference point, nose tip. And then, the frequency component extracted from the wavelet subband can be adopted as feature information for the authentication problems. The third step of approach concerns the application of eigenface to reduce the dimension. And the linear discriminant analysis (LDA) method to improve the classification ability between the similar features is adapted. In the last step, the individual classifiers using the modified fuzzy c-mean method based on the K-NN to initialize the membership degree is explained for extracted coefficient at each resolution level. In the experimental results, using the depth threshold value 60 (DT60) showed the highest recognition rate among the extracted regions, and the proposed classification method achieved 98.3% recognition rate, incase of fuzzy cluster.

  • PDF

A Proposal of a Keyword Extraction System for Detecting Social Issues (사회문제 해결형 기술수요 발굴을 위한 키워드 추출 시스템 제안)

  • Jeong, Dami;Kim, Jaeseok;Kim, Gi-Nam;Heo, Jong-Uk;On, Byung-Won;Kang, Mijung
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.3
    • /
    • pp.1-23
    • /
    • 2013
  • To discover significant social issues such as unemployment, economy crisis, social welfare etc. that are urgent issues to be solved in a modern society, in the existing approach, researchers usually collect opinions from professional experts and scholars through either online or offline surveys. However, such a method does not seem to be effective from time to time. As usual, due to the problem of expense, a large number of survey replies are seldom gathered. In some cases, it is also hard to find out professional persons dealing with specific social issues. Thus, the sample set is often small and may have some bias. Furthermore, regarding a social issue, several experts may make totally different conclusions because each expert has his subjective point of view and different background. In this case, it is considerably hard to figure out what current social issues are and which social issues are really important. To surmount the shortcomings of the current approach, in this paper, we develop a prototype system that semi-automatically detects social issue keywords representing social issues and problems from about 1.3 million news articles issued by about 10 major domestic presses in Korea from June 2009 until July 2012. Our proposed system consists of (1) collecting and extracting texts from the collected news articles, (2) identifying only news articles related to social issues, (3) analyzing the lexical items of Korean sentences, (4) finding a set of topics regarding social keywords over time based on probabilistic topic modeling, (5) matching relevant paragraphs to a given topic, and (6) visualizing social keywords for easy understanding. In particular, we propose a novel matching algorithm relying on generative models. The goal of our proposed matching algorithm is to best match paragraphs to each topic. Technically, using a topic model such as Latent Dirichlet Allocation (LDA), we can obtain a set of topics, each of which has relevant terms and their probability values. In our problem, given a set of text documents (e.g., news articles), LDA shows a set of topic clusters, and then each topic cluster is labeled by human annotators, where each topic label stands for a social keyword. For example, suppose there is a topic (e.g., Topic1 = {(unemployment, 0.4), (layoff, 0.3), (business, 0.3)}) and then a human annotator labels "Unemployment Problem" on Topic1. In this example, it is non-trivial to understand what happened to the unemployment problem in our society. In other words, taking a look at only social keywords, we have no idea of the detailed events occurring in our society. To tackle this matter, we develop the matching algorithm that computes the probability value of a paragraph given a topic, relying on (i) topic terms and (ii) their probability values. For instance, given a set of text documents, we segment each text document to paragraphs. In the meantime, using LDA, we can extract a set of topics from the text documents. Based on our matching process, each paragraph is assigned to a topic, indicating that the paragraph best matches the topic. Finally, each topic has several best matched paragraphs. Furthermore, assuming there are a topic (e.g., Unemployment Problem) and the best matched paragraph (e.g., Up to 300 workers lost their jobs in XXX company at Seoul). In this case, we can grasp the detailed information of the social keyword such as "300 workers", "unemployment", "XXX company", and "Seoul". In addition, our system visualizes social keywords over time. Therefore, through our matching process and keyword visualization, most researchers will be able to detect social issues easily and quickly. Through this prototype system, we have detected various social issues appearing in our society and also showed effectiveness of our proposed methods according to our experimental results. Note that you can also use our proof-of-concept system in http://dslab.snu.ac.kr/demo.html.

Wavelet based Fuzzy Integral System for 3D Face Recognition (퍼지적분을 이용한 웨이블릿 기반의 3차원 얼굴 인식)

  • Lee, Yeung-Hak;Shim, Jae-Chang
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.10
    • /
    • pp.616-626
    • /
    • 2008
  • The face shape extracted by the depth values has different appearance as the most important facial feature information and the face images decomposed into frequency subband are signified personal features in detail. In this paper, we develop a method for recognizing the range face images by combining the multiple frequency domains for each depth image and depth fusion using fuzzy integral. For the proposed approach, the first step tries to find the nose tip that has a protrusion shape on the face from the extracted face area. It is used as the reference point to normalize for orientated facial pose and extract multiple areas by the depth threshold values. In the second step, we adopt as features for the authentication problem the wavelet coefficient extracted from some wavelet subband to use feature information. The third step of approach concerns the application of eigenface and Linear Discriminant Analysis (LDA) method to reduce the dimension and classify. In the last step, the aggregation of the individual classifiers using the fuzzy integral is explained for extracted coefficient at each resolution level. In the experimental results, using the depth threshold value 60 (DT60) show the highest recognition rate among the regions, and the depth fusion method achieves 98.6% recognition rate, incase of fuzzy integral.

Research Trend Analysis of Publications in the Journal of Home Economics Education Association Using Network Text Analysis (네트워크 텍스트 분석을 이용한 한국가정과교육학회지 논문의 연구 동향 분석)

  • Lee, Yoon-Jung;Kim, Eun Jeung;Kim, Ji sun
    • Journal of Korean Home Economics Education Association
    • /
    • v.31 no.4
    • /
    • pp.1-18
    • /
    • 2019
  • The purpose of this study was to analyze the research trend in home economics education using network text analysis method. The 586 research articles published in the Journal of Home Economics Education Association between July, 2003 and December 2018 were examined using Neckinger 4, a social network analysis software. The frequency and centrality measures(degree centrality, closeness centrality, and betweenness centrality) were calculated for the words appeared throughout the whole period, and the centrality analysis and LAD(Latent Dirichlet Allocation) were conducted for the four sub-periods. The results are as follows: first, the most frequently appeared words are parents, culture, unit, health, career, consumption, practicality, etc. The words such as parents and management scored high in degree centrality; parents and male students in closeness centrality; and male students and units in betweenness centrality. Second, when divided into four periods, the words such as education, family, purpose, class, middle school, and school appeared most frequently across the periods; but some words such as 'purpose' (in period 3 and 4), or 'process' (in period 4) were salient only in certain periods. Third, the words with high centrality were consistent regardless of the types of centrality within each period. Fourth, the topic analysis using LAD showed that curriculum, textbook, family healthiness, teaching-learning, evaluation, dietary life, appearance management, and consumption were the topics consistently appeared across all periods. The topics have become diversified and deepened. New topics such as teacher training and safety appeared in later periods, possibly due to the curriculum and national policy changes, and housing as a less represented topic is suggested as an area that needs further research attention. This study has implication in that it allows researchers to identify the major research interests and the trends in research by researchers in home economic education.

Comparative Analysis of the Keywords in Taekwondo News Articles by Year: Applying Topic Modeling Method (태권도 뉴스기사의 연도별 주제어 비교분석: 토픽모델링 적용)

  • Jeon, Minsoo;Lim, Hyosung
    • Journal of Digital Convergence
    • /
    • v.19 no.11
    • /
    • pp.575-583
    • /
    • 2021
  • This study aims to analyze Taekwondo trends according to news articles by year by applying topic modeling. In order to examine the Taekwondo trend through media reports, articles including news articles and Taekwondo specialized media articles were collected through Big Kinds of the Korea Press Foundation. The search period was divided into three sections: before 2000, 2001~2010, and 2011~2020. A total of 12,124 items were selected as research data. For topic analysis, pre-processing was performed, and topic analysis was performed using the LDA algorithm. In this case, python 3 was applied for all analysis. First, as a result of analyzing the topics of media articles by year, 'World' was the most common keyword before 2000. 'South and North Korea' was next common and 'Olympic' was the third commonest topic. From 2001 to 2010, 'World' was the most common topic, followed by 'Association' and 'World Taekwondo'. From 2011 to 2020, 'World', 'Demonstration', and 'Kukkiwon' was the most common topic in that order. Second, as a result of analyzing news articles before 2000 by topic modeling, topics were divided into two categories. Specifically, Topic 1 was selected as 'South-North Korea sports exchange' and Topic 2 was selected as 'Adoption of Olympic demonstration events'. Third, as a result of analyzing news articles from 2001 to 2010 by topic modeling, three topics were selected. Topic 1 was selected as 'Taekwondo Demonstration Performance and Corruption', Topic 2 was selected as 'Muju Taekwondo Park Creation', and Topic 3 was selected as 'World Taekwondo Festival'. Fourth, as a result of analyzing news articles from 2011 to 2020 by topic modeling, three topics were selected. Topic 1 was selected as 'Successful Hosting of the 2018 Pyeongchang Winter Olympics', Topic 2 was selected as 'North-South Korea Taekwondo Joint Demonstration Performance', and Topic 3 was selected as '2017 Muju World Taekwondo Championships'.

Topic Modeling Insomnia Social Media Corpus using BERTopic and Building Automatic Deep Learning Classification Model (BERTopic을 활용한 불면증 소셜 데이터 토픽 모델링 및 불면증 경향 문헌 딥러닝 자동분류 모델 구축)

  • Ko, Young Soo;Lee, Soobin;Cha, Minjung;Kim, Seongdeok;Lee, Juhee;Han, Ji Yeong;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.2
    • /
    • pp.111-129
    • /
    • 2022
  • Insomnia is a chronic disease in modern society, with the number of new patients increasing by more than 20% in the last 5 years. Insomnia is a serious disease that requires diagnosis and treatment because the individual and social problems that occur when there is a lack of sleep are serious and the triggers of insomnia are complex. This study collected 5,699 data from 'insomnia', a community on 'Reddit', a social media that freely expresses opinions. Based on the International Classification of Sleep Disorders ICSD-3 standard and the guidelines with the help of experts, the insomnia corpus was constructed by tagging them as insomnia tendency documents and non-insomnia tendency documents. Five deep learning language models (BERT, RoBERTa, ALBERT, ELECTRA, XLNet) were trained using the constructed insomnia corpus as training data. As a result of performance evaluation, RoBERTa showed the highest performance with an accuracy of 81.33%. In order to in-depth analysis of insomnia social data, topic modeling was performed using the newly emerged BERTopic method by supplementing the weaknesses of LDA, which is widely used in the past. As a result of the analysis, 8 subject groups ('Negative emotions', 'Advice and help and gratitude', 'Insomnia-related diseases', 'Sleeping pills', 'Exercise and eating habits', 'Physical characteristics', 'Activity characteristics', 'Environmental characteristics') could be confirmed. Users expressed negative emotions and sought help and advice from the Reddit insomnia community. In addition, they mentioned diseases related to insomnia, shared discourse on the use of sleeping pills, and expressed interest in exercise and eating habits. As insomnia-related characteristics, we found physical characteristics such as breathing, pregnancy, and heart, active characteristics such as zombies, hypnic jerk, and groggy, and environmental characteristics such as sunlight, blankets, temperature, and naps.