• Title/Summary/Keyword: Gower coefficients

Search Result 1, Processing Time 0.013 seconds

Reinforced Generator GAN Model for Tabular Data Learning (Tabular Data 학습을 위한 강화형 생성자 GAN Mode)

  • Chan-sik Sung;Joon-sik Lim
    • Journal of Internet Computing and Services
    • /
    • v.25 no.5
    • /
    • pp.121-130
    • /
    • 2024
  • Tabular Data is a mixture of numerical and categorical data, and machine learning models have been evaluated to be more suitable than generative models in performing learning using such tabular data. This evaluation is because the generative model had a problem of excessively increasing parameters or not finding the direction of learning due to the numerical multimodal distribution and categorical frequency imbalance, which are characteristics of Tabular Data. However, as data gradually becomes big data and becomes real-time, existing machine learning models have shown limitations in their application. In this paper, as a methodology for applying generative models to tabular data, we propose RGGAN (Reinforced Generator GAN), a reinforced generator adversarial neural network that Clustering sampling that leverages conjugate prior distributions and the loss function improved with Gower coefficients and mutual information. As a result of measuring the AUC by detecting fraudulent transactions in the IEEE-CIS Fraud Detection Dataset by constructing an anomaly detector with the discriminators learned from the RGGAN proposed in this paper, it showed a performance improvement effect of 1-7% over the existing generative models, proving that the proposed model is effective for learning tabular data and also effective in detecting fraudulent transactions.