• Title/Summary/Keyword: Credit Prediction

Search Result 81, Processing Time 0.062 seconds

Deep Learning-based Delinquent Taxpayer Prediction: A Scientific Administrative Approach

  • YongHyun Lee;Eunchan Kim
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제18권1호
    • /
    • pp.30-45
    • /
    • 2024
  • This study introduces an effective method for predicting individual local tax delinquencies using prevalent machine learning and deep learning algorithms. The evaluation of credit risk holds great significance in the financial realm, impacting both companies and individuals. While credit risk prediction has been explored using statistical and machine learning techniques, their application to tax arrears prediction remains underexplored. We forecast individual local tax defaults in Republic of Korea using machine and deep learning algorithms, including convolutional neural networks (CNN), long short-term memory (LSTM), and sequence-to-sequence (seq2seq). Our model incorporates diverse credit and public information like loan history, delinquency records, credit card usage, and public taxation data, offering richer insights than prior studies. The results highlight the superior predictive accuracy of the CNN model. Anticipating local tax arrears more effectively could lead to efficient allocation of administrative resources. By leveraging advanced machine learning, this research offers a promising avenue for refining tax collection strategies and resource management.

A customer credit Prediction Researched to Improve Credit Stability based on Artificial Intelligence

  • MUN, Ji-Hui;JUNG, Sang Woo
    • 한국인공지능학회지
    • /
    • 제9권1호
    • /
    • pp.21-27
    • /
    • 2021
  • In this Paper, Since the 1990s, Korea's credit card industry has steadily developed. As a result, various problems have arisen, such as careless customer information management and loans to low-credit customers. This, in turn, had a high delinquency rate across the card industry and a negative impact on the economy. Therefore, in this paper, based on Azure, we analyze and predict the delinquency and delinquency periods of credit loans according to gender, own car, property, number of children, education level, marital status, and employment status through linear regression analysis and enhanced decision tree algorithm. These predictions can consequently reduce the likelihood of reckless credit lending and issuance of credit cards, reducing the number of bad creditors and reducing the risk of banks. In addition, after classifying and dividing the customer base based on the predicted result, it can be used as a basis for reducing the risk of credit loans by developing a credit product suitable for each customer. The predicted result through Azure showed that when predicting with Linear Regression and Boosted Decision Tree algorithm, the Boosted Decision Tree algorithm made more accurate prediction. In addition, we intend to increase the accuracy of the analysis by assigning a number to each data in the future and predicting again.

신용카드 대손회원 예측을 위한 SVM 모형 (Credit Card Bad Debt Prediction Model based on Support Vector Machine)

  • 김진우;지원철
    • 한국IT서비스학회지
    • /
    • 제11권4호
    • /
    • pp.233-250
    • /
    • 2012
  • In this paper, credit card delinquency means the possibility of occurring bad debt within the certain near future from the normal accounts that have no debt and the problem is to predict, on the monthly basis, the occurrence of delinquency 3 months in advance. This prediction is typical binary classification problem but suffers from the issue of data imbalance that means the instances of target class is very few. For the effective prediction of bad debt occurrence, Support Vector Machine (SVM) with kernel trick is adopted using credit card usage and payment patterns as its inputs. SVM is widely accepted in the data mining society because of its prediction accuracy and no fear of overfitting. However, it is known that SVM has the limitation in its ability to processing the large-scale data. To resolve the difficulties in applying SVM to bad debt occurrence prediction, two stage clustering is suggested as an effective data reduction method and ensembles of SVM models are also adopted to mitigate the difficulty due to data imbalance intrinsic to the target problem of this paper. In the experiments with the real world data from one of the major domestic credit card companies, the suggested approach reveals the superior prediction accuracy to the traditional data mining approaches that use neural networks, decision trees or logistics regressions. SVM ensemble model learned from T2 training set shows the best prediction results among the alternatives considered and it is noteworthy that the performance of neural networks with T2 is better than that of SVM with T1. These results prove that the suggested approach is very effective for both SVM training and the classification problem of data imbalance.

Diffusion Model을 활용한 신용 예측 데이터 불균형 해결 기법 (Mitigating Data Imbalance in Credit Prediction using the Diffusion Model)

  • 오상민;이주홍
    • 스마트미디어저널
    • /
    • 제13권2호
    • /
    • pp.9-15
    • /
    • 2024
  • 본 논문에서는 신용 예측에서 발생하는 불균형 문제를 해결하기 위해 Diffusion Multi-step Classifier(DMC)를 제안한다. DMC는 Diffusion Model을 통해 신용 예측 데이터의 연속적인 수치형 데이터들을 생성하고 생성된 데이터들을 Multi-step Classifier로 구분하는 것으로 범주형 데이터를 생성한다. DMC를 통해 기존의 데이터를 생성하는 다른 알고리즘보다 실제 데이터와 유사한 분포를 가지는 데이터를 생성할 수 있었다. 이렇게 생성된 데이터를 사용하여 실험을 진행하였을 때 연체를 예측할 확률이 20%이상 상승하였으며, 전체적으로 예측 정확성은 약 4%정도 상승하였다. 이러한 연구 결과는 실제 금융기관에 적용 시 연체율 감소와 수익 증가에 큰 기여를 할 수 있을것으로 예상된다.

Incorporating BERT-based NLP and Transformer for An Ensemble Model and its Application to Personal Credit Prediction

  • Sophot Ky;Ju-Hong Lee;Kwangtek Na
    • 스마트미디어저널
    • /
    • 제13권4호
    • /
    • pp.9-15
    • /
    • 2024
  • Tree-based algorithms have been the dominant methods used build a prediction model for tabular data. This also includes personal credit data. However, they are limited to compatibility with categorical and numerical data only, and also do not capture information of the relationship between other features. In this work, we proposed an ensemble model using the Transformer architecture that includes text features and harness the self-attention mechanism to tackle the feature relationships limitation. We describe a text formatter module, that converts the original tabular data into sentence data that is fed into FinBERT along with other text features. Furthermore, we employed FT-Transformer that train with the original tabular data. We evaluate this multi-modal approach with two popular tree-based algorithms known as, Random Forest and Extreme Gradient Boosting, XGBoost and TabTransformer. Our proposed method shows superior Default Recall, F1 score and AUC results across two public data sets. Our results are significant for financial institutions to reduce the risk of financial loss regarding defaulters.

Corporate credit rating prediction using support vector machines

  • 이영찬
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2005년도 공동추계학술대회
    • /
    • pp.571-578
    • /
    • 2005
  • Corporate credit rating analysis has drawn a lot of research interests in previous studies, and recent studies have shown that machine learning techniques achieved better performance than traditional statistical ones. This paper applies support vector machines (SVMs) to the corporate credit rating problem in an attempt to suggest a new model with better explanatory power and stability. To serve this purpose, the researcher uses a grid-search technique using 5-fold cross-validation to find out the optimal parameter values of kernel function of SVM. In addition, to evaluate the prediction accuracy of SVM, the researcher compares its performance with those of multiple discriminant analysis (MDA), case-based reasoning (CBR), and three-layer fully connected back-propagation neural networks (BPNs). The experiment results show that SVM outperforms the other methods.

  • PDF

Support Vector Machine을 이용한 지능형 신용평가시스템 개발 (Development of Intelligent Credit Rating System using Support Vector Machines)

  • 김경재
    • 한국정보통신학회논문지
    • /
    • 제9권7호
    • /
    • pp.1569-1574
    • /
    • 2005
  • In this paper, I propose an intelligent credit rating system using a bankruptcy prediction model based on support vector machines (SVMs). SVMs are promising methods because they use a risk function consisting of the empirical error and a regularized term which is derived from the structural risk minimization principle. This study examines the feasibility of applying SVM in Predicting corporate bankruptcies by comparing it with other data mining techniques. In addition. this study presents architecture and prototype of intelligeht credit rating systems based on SVM models.

Frequency Matrix 기법을 이용한 결측치 자료로부터의 개인신용예측 (Predicting Personal Credit Rating with Incomplete Data Sets Using Frequency Matrix technique)

  • 배재권;김진화;황국재
    • Journal of Information Technology Applications and Management
    • /
    • 제13권4호
    • /
    • pp.273-290
    • /
    • 2006
  • This study suggests a frequency matrix technique to predict personal credit rate more efficiently using incomplete data sets. At first this study test on multiple discriminant analysis and logistic regression analysis for predicting personal credit rate with incomplete data sets. Missing values are predicted with mean imputation method and regression imputation method here. An artificial neural network and frequency matrix technique are also tested on their performance in predicting personal credit rating. A data set of 8,234 customers in 2004 on personal credit information of Bank A are collected for the test. The performance of frequency matrix technique is compared with that of other methods. The results from the experiments show that the performance of frequency matrix technique is superior to that of all other models such as MDA-mean, Logit-mean, MDA-regression, Logit-regression, and artificial neural networks.

  • PDF

A Study on the Prediction Model for International Trade Payment Using Logistic Regression

  • Joo, Hye-Young;Lee, Dong-Jun
    • Journal of Korea Trade
    • /
    • 제25권2호
    • /
    • pp.111-133
    • /
    • 2021
  • Purpose - Although remittance payment in international trade settlements has played a bigger role in recent years, scant research is being done. This study is to zero in on analyzing determinants of international trade payments focused on remittance by constructing a payment prediction model. Design/methodology - This study categorizes the types of trade payments into advance remittance, post remittance, linked remittance, letter of credit, and mixed payment, and analyzes these after constructing a logit model. For empirical analysis, 147 survey data were collected for export manufacturers in Korea, and binominal logistic regression analysis was used to analyze the type of payment method the exporter chooses for trade transactions. Findings - The likelihood of choosing advance remittance increased as the exporters had non-recovery experiences with payments, and decreased as the market power of importers increased. The possibility of post remittance increased when the export amount was large and the character of the buyer was reliable. In the case of linked remittance, it was highly likely to be selected when payment efficiency was important in trade settlement. In addition, when competition among companies in the global market is intense and market uncertainty is high, the possibility of using a letter of credit decreases. It was also found that the greater the export amount, the greater the possibility of choosing advance remittance, and even if the transaction period was longer, exporters using a letter of credit continued to use it. Originality/value - Despite the high proportion of remittances in international trade settlements, it has been hard to find studies that reflect the practical characteristics of remittances. This study classified the types of remittance into advance remittance, post remittance, and linked remittance, and built a trade payment prediction model by adding a letter of credit and mixed payment. In addition, the originality of this study is recognized in that a logistic model was constructed and meaningful results were derived.