• Title/Summary/Keyword: Latent class model

Search Result 69, Processing Time 0.02 seconds

Learning Probabilistic Kernel from Latent Dirichlet Allocation

  • Lv, Qi;Pang, Lin;Li, Xiong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.6
    • /
    • pp.2527-2545
    • /
    • 2016
  • Measuring the similarity of given samples is a key problem of recognition, clustering, retrieval and related applications. A number of works, e.g. kernel method and metric learning, have been contributed to this problem. The challenge of similarity learning is to find a similarity robust to intra-class variance and simultaneously selective to inter-class characteristic. We observed that, the similarity measure can be improved if the data distribution and hidden semantic information are exploited in a more sophisticated way. In this paper, we propose a similarity learning approach for retrieval and recognition. The approach, termed as LDA-FEK, derives free energy kernel (FEK) from Latent Dirichlet Allocation (LDA). First, it trains LDA and constructs kernel using the parameters and variables of the trained model. Then, the unknown kernel parameters are learned by a discriminative learning approach. The main contributions of the proposed method are twofold: (1) the method is computationally efficient and scalable since the parameters in kernel are determined in a staged way; (2) the method exploits data distribution and semantic level hidden information by means of LDA. To evaluate the performance of LDA-FEK, we apply it for image retrieval over two data sets and for text categorization on four popular data sets. The results show the competitive performance of our method.

A Longitudinal Analysis of the Number of Checked-out Books Using Latent Growth Model and Growth Mixture Modeling (잠재성장모형과 성장혼합모형을 이용한 도서관 대출권수의 종단적 분석)

  • Heejin Park;Sungjae Park
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.1
    • /
    • pp.45-68
    • /
    • 2023
  • The purpose of this study is to longitudinally analyze impact factors on library use. One of library use indicators, the number of circulated books was statistically analyzed with latent growth model and growth mixture model. Library data from 2014 to 2019 were collected from the National Library Statistics System, and 846 public libraries were analyzed. As results, the number of circulated books were decreased, but it was tempered. Next, with controlling the factor affecting the dependent variable, the size of collection and the number of participants in reading programs provided by public libraries were statistically significant. Lastly, 5 classes were identified by applying the growth mixture model, and the number of librarians was significantly associated with trajectory class membership.

Label Embedding for Improving Classification Accuracy UsingAutoEncoderwithSkip-Connections (다중 레이블 분류의 정확도 향상을 위한 스킵 연결 오토인코더 기반 레이블 임베딩 방법론)

  • Kim, Museong;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.175-197
    • /
    • 2021
  • Recently, with the development of deep learning technology, research on unstructured data analysis is being actively conducted, and it is showing remarkable results in various fields such as classification, summary, and generation. Among various text analysis fields, text classification is the most widely used technology in academia and industry. Text classification includes binary class classification with one label among two classes, multi-class classification with one label among several classes, and multi-label classification with multiple labels among several classes. In particular, multi-label classification requires a different training method from binary class classification and multi-class classification because of the characteristic of having multiple labels. In addition, since the number of labels to be predicted increases as the number of labels and classes increases, there is a limitation in that performance improvement is difficult due to an increase in prediction difficulty. To overcome these limitations, (i) compressing the initially given high-dimensional label space into a low-dimensional latent label space, (ii) after performing training to predict the compressed label, (iii) restoring the predicted label to the high-dimensional original label space, research on label embedding is being actively conducted. Typical label embedding techniques include Principal Label Space Transformation (PLST), Multi-Label Classification via Boolean Matrix Decomposition (MLC-BMaD), and Bayesian Multi-Label Compressed Sensing (BML-CS). However, since these techniques consider only the linear relationship between labels or compress the labels by random transformation, it is difficult to understand the non-linear relationship between labels, so there is a limitation in that it is not possible to create a latent label space sufficiently containing the information of the original label. Recently, there have been increasing attempts to improve performance by applying deep learning technology to label embedding. Label embedding using an autoencoder, a deep learning model that is effective for data compression and restoration, is representative. However, the traditional autoencoder-based label embedding has a limitation in that a large amount of information loss occurs when compressing a high-dimensional label space having a myriad of classes into a low-dimensional latent label space. This can be found in the gradient loss problem that occurs in the backpropagation process of learning. To solve this problem, skip connection was devised, and by adding the input of the layer to the output to prevent gradient loss during backpropagation, efficient learning is possible even when the layer is deep. Skip connection is mainly used for image feature extraction in convolutional neural networks, but studies using skip connection in autoencoder or label embedding process are still lacking. Therefore, in this study, we propose an autoencoder-based label embedding methodology in which skip connections are added to each of the encoder and decoder to form a low-dimensional latent label space that reflects the information of the high-dimensional label space well. In addition, the proposed methodology was applied to actual paper keywords to derive the high-dimensional keyword label space and the low-dimensional latent label space. Using this, we conducted an experiment to predict the compressed keyword vector existing in the latent label space from the paper abstract and to evaluate the multi-label classification by restoring the predicted keyword vector back to the original label space. As a result, the accuracy, precision, recall, and F1 score used as performance indicators showed far superior performance in multi-label classification based on the proposed methodology compared to traditional multi-label classification methods. This can be seen that the low-dimensional latent label space derived through the proposed methodology well reflected the information of the high-dimensional label space, which ultimately led to the improvement of the performance of the multi-label classification itself. In addition, the utility of the proposed methodology was identified by comparing the performance of the proposed methodology according to the domain characteristics and the number of dimensions of the latent label space.

Analysis of Change Patterns in Assistive Technology Device Use of the Workers with Disabilities (취업장애인의 보조공학기기 사용의 변화형태 분석)

  • Jun, Y.H.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.6 no.1
    • /
    • pp.83-87
    • /
    • 2012
  • This study is aimed to identify latent classes which are based the change patterns in assistive technology device use among worker with disabilities and to test the effects of independent variables(gender, education, disability type, disability density, activity and participation of ICF: ICF, subjective socioeconomic status: SES, job satisfaction, life satisfaction) on determining latents classes. This study applied Nagin's(1999) semi-parametric group based approach to the panel survey of employment for the disabled. Because dependant variable has dichotomous scale, logit model was used. The results identified three latent classes, which could be defined based on the patterns as follows; assistive device continued use group, assistive device mid-level use group, assistive device sharp decline use group. The effects of the independent variables on the latent classes was tested by multinomial logit analysis. The results showed that education, disability type, ICF, SES, and life satisfaction were significant determinants of the latent classes. Finally, the implications based on analysis results were suggested.

  • PDF

Sensitivity of Typhoon Simulation to Physics Parameterizations in the Global Model (전구 모델의 물리과정에 따른 태풍 모의 민감도)

  • Kim, Ki-Byung;Lee, Eun-Hee;Seol, Kyung-Hee
    • Atmosphere
    • /
    • v.27 no.1
    • /
    • pp.17-28
    • /
    • 2017
  • The sensitivity of the typhoon track and intensity simulation to physics schemes of the global model are examined for the typhoon Bolaven and Tembin cases by using the Global/Regional Integrated Model System-Global Model Program (GRIMs-GMP) with the physics package version 2.0 of the Korea Institute of Atmospheric Prediction Systems. Microphysics, Cloudiness, and Planetary boundary Layer (PBL) parameterizations are changed and the impact of each scheme change to typhoon simulation is compared with the control simulation and observation. It is found that change of microphysics scheme from WRF Single-Moment 5-class (WSM5) to 1-class (WSM1) affects to the typhoon simulation significantly, showing the intensified typhoon activity and increased precipitation amount, while the effect of the prognostic cloudiness and PBL enhanced mixing scheme is not noticeable. It appears that WSM1 simulates relatively unstable and drier atmospheric structure than WSM5, which is induced by the latent heat change and the associated radiative effect due to not considering ice cloud. And WSM1 results the enhanced typhoon intensity and heavy rainfall simulation. It suggests that the microphysics is important to improve the capability for typhoon simulation of a global model and to increase the predictability of medium range forecast.

Longitudinal Patterns of Stages of Changes in Smoking Behaviors among Korean Adult Smokers: Applying the Transtheoretical Model of Change (범이론적 모델에 기반을 둔 흡연자의 금연행동 변화단계에 대한 탐색적 연구)

  • Park, Hyunyong;Jun, Jina;Sohn, Sunju
    • Korean Journal of Social Welfare Studies
    • /
    • v.49 no.1
    • /
    • pp.5-28
    • /
    • 2018
  • Smoking is one of the important public health concerns because it is preventable causes regarding individuals' negative health consequences and increased social and economic cost. However, few studies have examined longitudinal patterns of stages of changes(SOC) in smoking behaviors among the general population. The purpose of the study is to explore the latent patterns of SOC over time among Korean adult smokers using the 2008-2016 Korea Welfare Panel Study. A repeated measure latent class analysis is employed in the present study. The finding of the present study are as follows: First, four latent groups were identified: (1) action/maintenance stage(33.6%), (2) contemplation/preparation to action/maintenance stage(14.8%), (3) continuously contemplation/preparation stage(29.6%), and (4) continuously pre-contemplation stage(22.1%). Second, the results of a multinomial logistic regression found that socio-demographic and clinical characteristics were associated with the identified longitudinal patterns of smoking behaviors. Compared to a continuously pre-contemplation stage, higher levels of depressive symptoms and drinking behavior were associated with increased odds of being in action/maintenance stage. The findings of the present study highlight that a tailored intervention is needed for individuals with continuously pre-contemplation stage and contemplation stage.

Identifying latent classes about the changing trajectories of child maltreatment by child developmental period (아동발달시기별 아동학대 변화궤적 유형 비교 연구)

  • Han, Jihyeon;Choi, Okchae
    • Journal of the Korean Society of Child Welfare
    • /
    • no.59
    • /
    • pp.183-208
    • /
    • 2017
  • The purpose of this study was to identify groups depending on the change trajectory of child maltreatment in childhood and early adolescence. For this study, the data from waves two through six (2011-2015) of the Korean Children and Youth Panel Survey (KCYPS) were used. Participants included first-grade (n=2,300) and fourth-grade (n=2,325) elementary school students. A latent class growth model (LCGM) using Mplus 7.21 was adopted to classify the types of developmental trajectories of child maltreatment. The main results were as follows: First, in physical abuse, childhood from the second to the sixth grades was classified into four groups: decreased, low maintenance, increased, and no maltreatment type. In addition, early adolescence from the fifth grade of elementary school to the third grade of middle school was also classified into the same types. Second, in emotional abuse, childhood was classified into three groups: decreased, increased, and no maltreatment type. Early adolescence was classified into four groups: decreased, low maintenance, increased, and no maltreatment type. Third, in neglect, childhood was classified into four groups: maintenance, low decreasing, low increasing, and no maltreatment type. Early adolescence was classified into three groups: maintenance, low increasing, and no maltreatment type. According to the change of child maltreatment by developmental period, physical abuse continued from childhood to early adolescence, whereas emotional abuse and neglect increased in early adolescence compared to childhood. This study is meaningful in classifying latent classes depending on maltreatment types. Theoretical and practical implications were suggested based on the study results.

A Latent Profile Analysis of Stress Coping Strategies among Korean Adults at the Early Stage of the Coronavirus Pandemic(COVID-19) and Verification of Influencing Factors (코로나 팬데믹 초기 한국인의 스트레스 대처 양상에 따른 잠재계층 분류와 영향요인 검증)

  • Nam, Seulki;Lee, Dong Hun
    • Korean Journal of Culture and Social Issue
    • /
    • v.28 no.3
    • /
    • pp.483-512
    • /
    • 2022
  • This study examined the patterns of coping strategies among Koreans during the early stage of the COVID-19 pandemic, explored the influence of demographic information (gender, age, economic level, household type), along with the unusual experiences due to COVID-19 (fear, stress of COVID, constraints of routine, income risk) on the classification of subclasses, and analyzed the latent profile differences in psychological wellbeing (life satisfaction, depression, and anxiety). An online survey was conducted among Korean Adults(n=600) between April 13, 2020 and 21, when WHO declared COVID-19 a global pandemic and Daegu as well as Gyeongsangbuk-do was nominated as a special disaster zone. First, Latent Profile Analysis (LPA) was used to identify subclasses of coping strategies and results suggested that the 4-class model had the best fit. Second, Class memberships were predicted by gender, age, economic level, as well as fear, stress, constraints of routine, and income risk, among the unusual experiences due to COVID-19. Finally, there are differences in psychological wellbeing among latent profiles. 'High level of adaptive coping group 3' showed the highest level of life satisfaction, 'Adaptive-maladaptive coping group 4' showed the highest level of depression, anxiety. Implications and suggestions are discussed based on the study results.

Online anomaly detection algorithm based on deep support vector data description using incremental centroid update (점진적 중심 갱신을 이용한 deep support vector data description 기반의 온라인 비정상 탐지 알고리즘)

  • Lee, Kibae;Ko, Guhn Hyeok;Lee, Chong Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.2
    • /
    • pp.199-209
    • /
    • 2022
  • Typical anomaly detection algorithms are trained by using prior data. Thus the batch learning based algorithms cause inevitable performance degradation when characteristics of newly incoming normal data change over time. We propose an online anomaly detection algorithm which can consider the gradual characteristic changes of incoming normal data. The proposed algorithm based on one-class classification model includes both offline and online learning procedures. In offline learning procedure, the algorithm learns the prior data to be close to centroid of the latent space and then updates the centroid of the latent space incrementally by new incoming data. In the online learning, the algorithm continues learning by using the updated centroid. Through experiments using public underwater acoustic data, the proposed online anomaly detection algorithm takes only approximately 2 % additional learning time for the incremental centroid update and learning. Nevertheless, the proposed algorithm shows 19.10 % improvement in Area Under the receiver operating characteristic Curve (AUC) performance compared to the offline learning model when new incoming normal data comes.

Development of Social Work Strategies for School-linked services - Based on Latent Class Growth Analysis of Delinquent Behaviors in adolescence - (학교연계 서비스를 위한 사회복지실천 전략 개발 - 청소년기 경비행행동의 차별적 발달궤적에 대한 잠재계층성장분석 -)

  • Lee, Sang-Gyun
    • Korean Journal of Social Welfare Studies
    • /
    • v.40 no.3
    • /
    • pp.377-406
    • /
    • 2009
  • This study used laten class growth analysis to identify discrete developmental patterns of delinquent behaviors in adolescence. This present article also examined associations among these trajectories to determine how the development of delinquent behaviors relates to protective and risk factors, which include parental monitoring, attachment with parent, association with deviant peers, self-control, and negative stigma from others. Four-wave panel data from a Korea Youth Panel Study were used for the latent class growth model analysis. The sample consisted of 3,446 adolescents who were assessed at 4 measurement waves with approximately 1-year interval. Four trajectories of delinquent behaviors emerged: delinquency persistence, delinquency increaser, delinquency decreaser, normative group(almost no delinquent behaviors). Association with deviant peers had the most proximal strong influence on the probability of being in the delinquency increaser and delinquency persistence group compared, noed to the normative group. Parental monitoring, self-efficacy and negative stigma also differentiated the four delinquent behavior trajectories from one another after controllig for socio-demographic variables. The study suggested that there is a significant heterogeneity in the timing and change rate of delinquency progression. Adolescent delinquency prevention and intervention programs will need to consider this heterogeneity and enhance attention to protective and risk factors depending on the subpopulation.