• Title/Summary/Keyword: Feature Variables

Search Result 366, Processing Time 0.03 seconds

Counterfactual image generation by disentangling data attributes with deep generative models

  • Jieon Lim;Weonyoung Joo
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.6
    • /
    • pp.589-603
    • /
    • 2023
  • Deep generative models target to infer the underlying true data distribution, and it leads to a huge success in generating fake-but-realistic data. Regarding such a perspective, the data attributes can be a crucial factor in the data generation process since non-existent counterfactual samples can be generated by altering certain factors. For example, we can generate new portrait images by flipping the gender attribute or altering the hair color attributes. This paper proposes counterfactual disentangled variational autoencoder generative adversarial networks (CDVAE-GAN), specialized for data attribute level counterfactual data generation. The structure of the proposed CDVAE-GAN consists of variational autoencoders and generative adversarial networks. Specifically, we adopt a Gaussian variational autoencoder to extract low-dimensional disentangled data features and auxiliary Bernoulli latent variables to model the data attributes separately. Also, we utilize a generative adversarial network to generate data with high fidelity. By enjoying the benefits of the variational autoencoder with the additional Bernoulli latent variables and the generative adversarial network, the proposed CDVAE-GAN can control the data attributes, and it enables producing counterfactual data. Our experimental result on the CelebA dataset qualitatively shows that the generated samples from CDVAE-GAN are realistic. Also, the quantitative results support that the proposed model can produce data that can deceive other machine learning classifiers with the altered data attributes.

AI Performance Based On Learning-Data Labeling Accuracy (인공지능 학습데이터 라벨링 정확도에 따른 인공지능 성능)

  • Ji-Hoon Lee;Jieun Shin
    • Journal of Industrial Convergence
    • /
    • v.22 no.1
    • /
    • pp.177-183
    • /
    • 2024
  • The study investigates the impact of data quality on the performance of artificial intelligence (AI). To this end, the impact of labeling error levels on the performance of artificial intelligence was compared and analyzed through simulation, taking into account the similarity of data features and the imbalance of class composition. As a result, data with high similarity between characteristic variables were found to be more sensitive to labeling accuracy than data with low similarity between characteristic variables. It was observed that artificial intelligence accuracy tended to decrease rapidly as class imbalance increased. This will serve as the fundamental data for evaluating the quality criteria and conducting related research on artificial intelligence learning data.

Multi-objective structural optimization of spatial steel frames with column orientation and bracing system as design variables

  • Claudio H. B. de Resende;Luiz F. Martha;Afonso C. C. Lemonge;Patricia H. Hallak;Jose P. G. Carvalho;Julia C. Motta
    • Advances in Computational Design
    • /
    • v.8 no.4
    • /
    • pp.327-351
    • /
    • 2023
  • This article explores how multi-objective optimization techniques can be used to design cost-effective and structurally optimal spatial steel structures, highlighting that optimizing performance can be as important as minimizing costs in real-world engineering problems. The study includes the minimization of maximum horizontal displacement, the maximization of the first natural frequency of vibration, the maximization of the critical load factor concerning the first global buckling mode of the structure, and weight minimization as the objectives. Additionally, it outlines a systematic approach to selecting the best design by employing four different evolutionary algorithms based on differential evolution and a multi-criteria decision-making methodology. The paper's contribution lies in its comprehensive consideration of multiple conflicting objectives and its novel approach to simultaneous consideration of bracing system, column orientation, and commercial profiles as design variables.

Optimization of Multiclass Support Vector Machine using Genetic Algorithm: Application to the Prediction of Corporate Credit Rating (유전자 알고리즘을 이용한 다분류 SVM의 최적화: 기업신용등급 예측에의 응용)

  • Ahn, Hyunchul
    • Information Systems Review
    • /
    • v.16 no.3
    • /
    • pp.161-177
    • /
    • 2014
  • Corporate credit rating assessment consists of complicated processes in which various factors describing a company are taken into consideration. Such assessment is known to be very expensive since domain experts should be employed to assess the ratings. As a result, the data-driven corporate credit rating prediction using statistical and artificial intelligence (AI) techniques has received considerable attention from researchers and practitioners. In particular, statistical methods such as multiple discriminant analysis (MDA) and multinomial logistic regression analysis (MLOGIT), and AI methods including case-based reasoning (CBR), artificial neural network (ANN), and multiclass support vector machine (MSVM) have been applied to corporate credit rating.2) Among them, MSVM has recently become popular because of its robustness and high prediction accuracy. In this study, we propose a novel optimized MSVM model, and appy it to corporate credit rating prediction in order to enhance the accuracy. Our model, named 'GAMSVM (Genetic Algorithm-optimized Multiclass Support Vector Machine),' is designed to simultaneously optimize the kernel parameters and the feature subset selection. Prior studies like Lorena and de Carvalho (2008), and Chatterjee (2013) show that proper kernel parameters may improve the performance of MSVMs. Also, the results from the studies such as Shieh and Yang (2008) and Chatterjee (2013) imply that appropriate feature selection may lead to higher prediction accuracy. Based on these prior studies, we propose to apply GAMSVM to corporate credit rating prediction. As a tool for optimizing the kernel parameters and the feature subset selection, we suggest genetic algorithm (GA). GA is known as an efficient and effective search method that attempts to simulate the biological evolution phenomenon. By applying genetic operations such as selection, crossover, and mutation, it is designed to gradually improve the search results. Especially, mutation operator prevents GA from falling into the local optima, thus we can find the globally optimal or near-optimal solution using it. GA has popularly been applied to search optimal parameters or feature subset selections of AI techniques including MSVM. With these reasons, we also adopt GA as an optimization tool. To empirically validate the usefulness of GAMSVM, we applied it to a real-world case of credit rating in Korea. Our application is in bond rating, which is the most frequently studied area of credit rating for specific debt issues or other financial obligations. The experimental dataset was collected from a large credit rating company in South Korea. It contained 39 financial ratios of 1,295 companies in the manufacturing industry, and their credit ratings. Using various statistical methods including the one-way ANOVA and the stepwise MDA, we selected 14 financial ratios as the candidate independent variables. The dependent variable, i.e. credit rating, was labeled as four classes: 1(A1); 2(A2); 3(A3); 4(B and C). 80 percent of total data for each class was used for training, and remaining 20 percent was used for validation. And, to overcome small sample size, we applied five-fold cross validation to our dataset. In order to examine the competitiveness of the proposed model, we also experimented several comparative models including MDA, MLOGIT, CBR, ANN and MSVM. In case of MSVM, we adopted One-Against-One (OAO) and DAGSVM (Directed Acyclic Graph SVM) approaches because they are known to be the most accurate approaches among various MSVM approaches. GAMSVM was implemented using LIBSVM-an open-source software, and Evolver 5.5-a commercial software enables GA. Other comparative models were experimented using various statistical and AI packages such as SPSS for Windows, Neuroshell, and Microsoft Excel VBA (Visual Basic for Applications). Experimental results showed that the proposed model-GAMSVM-outperformed all the competitive models. In addition, the model was found to use less independent variables, but to show higher accuracy. In our experiments, five variables such as X7 (total debt), X9 (sales per employee), X13 (years after founded), X15 (accumulated earning to total asset), and X39 (the index related to the cash flows from operating activity) were found to be the most important factors in predicting the corporate credit ratings. However, the values of the finally selected kernel parameters were found to be almost same among the data subsets. To examine whether the predictive performance of GAMSVM was significantly greater than those of other models, we used the McNemar test. As a result, we found that GAMSVM was better than MDA, MLOGIT, CBR, and ANN at the 1% significance level, and better than OAO and DAGSVM at the 5% significance level.

The Extended Technology Acceptance Model According to Smart Clothing Types (스마트 의류제품 유형에 따른 확장된 혁신기술수용모델)

  • Chae, Jin-Mie
    • Korean Journal of Human Ecology
    • /
    • v.19 no.2
    • /
    • pp.375-387
    • /
    • 2010
  • The Technology Acceptance Model (TAM) presented by Davis (1989) has been regarded as highly explanatory as well as the clearest model in explaining consumers' adoption of innovative technology or products. Existing studies have expanded the model by adding related external variables to improve the explanation depending on the type of innovative technology. This study expanded TAM by adding two more variables, namely consumers' technology innovation and clothing involvement considering the feature of smart clothing. The objectives of this study are as follows: 1. to suggest the extended TAM in explaining the adoption process of smart clothing, 2. to verify the differences in the path hypotheses according to the type of smart clothing. A total of 815 effective samples were collected from adults over 20 years old, and AMOS 5.0 package was employed for data analysis. As a result, it was proved that the extended TAM was appropriate for explaining the process of adopting smart clothing according to the path hypotheses of smart clothing types. Technology innovation and clothing involvement were confirmed as antecedent variables in affecting TAM. The perceived usefulness appeared to be a more crucial variable than the perceived ease of use and attitude was found to be an important parameter in adopting smart clothing. Considering the path hypotheses of MP3 playing clothes, perceived usefulness had a direct influence on acceptance intention unlike other types of smart clothing. As for photonic clothes, the influence of perceived ease of use on attitude was supported while it was rejected in the case of MP3 playing clothes and sensing sportswear.

A Generalized Method for Extracting Characters and Video Captions (일반화된 문자 및 비디오 자막 영역 추출 방법)

  • Chun, Byung-Tae;Bae, Young-Lae;Kim, Tai-Yun
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.6
    • /
    • pp.632-641
    • /
    • 2000
  • Conventional character extraction methods extract character regions using methods such as color reduction, region split and merge and texture analysis from the whole image. Because these methods use many heuristic variables and thresholding values derived from a priori knowledge, it is difficult to generalize them algorithmically. In this paper, we propose a method that can extract character regions using a topographical feature extraction method and a point-line-region extension method. The proposed method can also solve the problems of conventional methods by reducing heuristic variables and generalizing thresholding values. We see that character regions can be extracted by generalized variables and thresolding values without using a priori knowledge of character region. Experimental results show that the candidate region extraction rate is 100%, and the character region extraction rate is over 98%.

  • PDF

A Study of the Landscape Preference for 'Oreu'm by Analyses of Features & Visual Elements ("오름"의 형태와 시각량 분석을 통한 경관선호성 평가)

  • Kim, Sang-Beom;Sim, Woo-Kyung;Rho, Jae-Hyun
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.35 no.1 s.120
    • /
    • pp.48-58
    • /
    • 2007
  • The purpose of this study was not only to clearly examine the features of the scenery and visual elements of Oreum (parasitic cones) but also to investigate primary factors in landscape preferences for these cones. This study further attempted to gain basic information for examining the preservation of Oreum in regards to the influence of scenery on the general public. A Multiple Regression Analysis was used for this study for which the independent variable was the area ratio of the fore-, mid-, and background of the view under the feature element and the structure of the scenery at the Oreum. The dependent variables were the preference value, the number of summits, and the highest altitude of the Oreum. All but the highest inclination were positive variables. The area ratio of the Oreum was found to be the most important variable. The area of sky and the area of the distant scenery were shown to be positive explanation variables, while the area to the fore of the view and the area to the middle of the view were shown as negative explanation variable. In the preference for Oreum scenery, which has a high visibility and is clearly outlined against the skyline, it was found that as the hindrance element of visibility near to a visual point or the area ratio increased, the preference for the Oreum scenery decreased.

An Analysis for the Adjustment Process of Market Variations by the Formulation of Time tag Structure (시차구조의 설정에 따른 시장변동의 조정과정 분석)

  • 김태호;이청림
    • The Korean Journal of Applied Statistics
    • /
    • v.16 no.1
    • /
    • pp.87-100
    • /
    • 2003
  • Most of statistical data are generated by a set of dynamic, stochastic, and simultaneous relations. An important question is how to specify statistical models so that they are consistent with the dynamic feature of those data. A general hypothesis is that the lagged effect of a change in an explanatory variable is not felt all at once at a single point in time, but The impact is distributed over a number of future points in time. In other words, current control variables are determined by a function that can be reduced to a distributed lag function of past observations. It is possible to explain the relationship between variables in different points of time and to estimate the long-run impacts of a change in a variable on another if time lag series of explanatory variables are incorporated in the model specification. In this study, distributed lag structure is applied to the domestic stock market model to capture the dynamic response of the market by exogenous shocks. The Domestic market is found more responsive to the changes in foreign market factors both in the short and the long run.

Performance Improvement of General Regression Neural Network Using Principal Component Analysis (주요성분분석에 의한 일반회귀 신경망의 성능개선)

  • Cho, Yong-Hyun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.11
    • /
    • pp.3408-3416
    • /
    • 2000
  • This paper proposes an efficient method for improving the performance of a general regression neural network by using the feature to the independent variables as the center for partern-layer neurons. The adaptive principal component analysis is applied for extracting, efficiently the fcarures by reducing the dimension of given independent variables. In can acluevc a supertor property of the principal component analysis that converts input data into set of statistically independent features and the general regression neuralnetwork, espedtively. The proposed general regression neural network has been applied to regress the Solow's economy(2-independent variable set) and the wie elephone(1-independent vanable set). The simulation results show that the proposed meural networks have better performances of the regressionfor the lest data, in comparison with those using the means or the weighted means of independent variables. Also,it is affected less by the number of neurons and the scope of the smoothing factor.

  • PDF

Multi-alternative Retrofit Modelling and its Application to Korean Generation Capacity Expansion Planning (발전설비확장계획에서 다중대안 리트로핏 모형화 방안 및 사례연구)

  • Chung, Yong Joo
    • The Journal of Information Systems
    • /
    • v.29 no.1
    • /
    • pp.75-91
    • /
    • 2020
  • Purpose Retrofit, defined to be addition of new technologies or features to the old system to increase efficiency or to abate GHG emissions, is considered as an important alternative for the old coal-fired power plant. The purpose of this study is to propose mathematical method to model multiple alternative retrofit in Generation Capacity Expansion Planning(GCEP) problem, and to get insight to the retrofit patterns from realistic case studies. Design/methodology/approach This study made a multi-alternative retrofit GECP model by adopting some new variables and equations to the existing GECP model. Added variables and equations are to ensure the retrofit feature that the life time of retrofitted plant is the remaining life time of the old power plant. We formulated such that multiple retrofit alternatives are simultaneously compared and the best retrofit alternative can be selected. And we found that old approach to model retrofit has a problem that old plant with long remaining life time is retrofitted earlier than the one with short remaining life time, fixed the problem by some constraints with some binary variables. Therefore, the proposed model is formulated into a mixed binary programming problem, and coded and run using the GAMS/cplex. Findings According to the empirical analysis result, we found that approach to model the multiple alternative retrofit proposed in this study is comparing simultaneously multiple retrofit alternatives and select the best retrofit satisfying the retrofit features related to the life time. And we found that retrofit order problem is cleared. In addition, the model is expected to be very useful in evaluating and developing the national policies concerning coal-fired power plant retrofit.