Search | Korea Science

Comparison of Feature Selection Methods in Support Vector Machines (지지벡터기계의 변수 선택방법 비교)

Kim, Kwangsu;Park, Changyi
- The Korean Journal of Applied Statistics
- /
- v.26 no.1
- /
- pp.131-139
- /
- 2013
Support vector machines(SVM) may perform poorly in the presence of noise variables; in addition, it is difficult to identify the importance of each variable in the resulting classifier. A feature selection can improve the interpretability and the accuracy of SVM. Most existing studies concern feature selection in the linear SVM through penalty functions yielding sparse solutions. Note that one usually adopts nonlinear kernels for the accuracy of classification in practice. Hence feature selection is still desirable for nonlinear SVMs. In this paper, we compare the performances of nonlinear feature selection methods such as component selection and smoothing operator(COSSO) and kernel iterative feature extraction(KNIFE) on simulated and real data sets.
https://doi.org/10.5351/KJAS.2013.26.1.131 인용 PDF KSCI

내구소비재 보유함수의 추정: 이진수 종속변수를 이용한 회귀분석

Yoon, Suk Bum;Lee, Hoe Kyung
- Journal of the Korean Statistical Society
- /
- v.6 no.2
- /
- pp.117-154
- /
- 1977
본논문에서는 첫째로 단일방정식 모형에서 종속변수가 양자택일(binary choice)의 이산확률변수일 때 이러한 이진적 종속변수(binary dependent variable)의 변동을 설명하는데 사용되는 몇 가지 모형을 소개하고 각각의 표기 및 추정방법, 추정량의 성질, 예측 및 검정 문제 등에 관하여 비교 서술하고자 한다. 둘째, 종속변수가 이산과 연속의 혼합형태일 때 앞에 소개된 모형이 어떻게 적용될 수 있는가를 살펴보며, 셋째, 선택대상 및 종속변수의 수가 증가하여 일반화된 선다형모형(multiple choice model)의 경우, 표기 및 추정방법을 단일방정식 기법을 이용하여 추가로 총람하고자 한다. 넷째, 본논문에서는 또한 내구소비재 구입에 관한 조사자료를 이용하여 실제 많이 사용되는 몇 개의 모형을 선택하여 적용하고 각각의 예측력을 분석함으로써 각 모형을 비교 검토하는데 목적이 있다.
PDF

신경망기법을 이용한 기업부실예측에 관한 연구

Jeong, Gi-Ung;Hong, Gwan-Su
- The Korean Journal of Financial Management
- /
- v.12 no.2
- /
- pp.1-23
- /
- 1995
본 연구의 목적은 특정 금융기관의 주거래기업들에 대한 부실예측을 위해 주거래기업들을 잠식, 도산, 그리고 건전기업과 같이 세집단으로 구분하여 예측하고자 하며, 기업부실 예측력에 영향을 미치는 세 가지 요인으로서 표본구성, 투입 변수, 분석 기법의 관점에서 다음을 살펴보는 것이다. 첫째, 기업부실예측에서 전통적인 delta learning rule과 sigmoid함수를 사용한 역전파학습(신경망 I)과 이들의 변형형태인 normalized cumulative delta learning rule과 hyperbolic tangent함수를 사용한 역전파 학습(신경망 II)과의 예측력의 차이를 살펴보고 또한 이러한 두가지 신경망기법의 예측력을 MDA(다변량판별분석) 결과와 비교하여 신경망기법에 대한 예측력의 유용성을 살펴보고자 한다. 둘째, 세집단분류문제에서는 잠식, 도산, 건전기업의 구성비율이 위의 세가지 예측기법의 결과에 어떠한 영향을 미치는지를 살펴보고자 한다. 세째, 투입 변수선정은 기존연구 또는 이론을 바탕으로 연구자의 판단에 의해 선택하는 방법과 다수의 변수를 가지고 통계적기법에 의해 좋은 판별변수의 집합을 찾는 것이다. 본 연구에서는 이러한 방법들에 의해 선정된 투입변수들이 세가지 예측기법의 결과에 어떠한 영향을 미치는지를 살펴보고자 한다. 이러한 관점에서 본 연구의 실증분석 결과를 요약하면 다음과 같다. 1) 신경망기법이 두집단에서와 같이 세집단 분류문제에서도 MDA보다는 더 높은 예측력을 보였다. 2) 잠식과 도산기업의 수는 비슷하게 그리고 건전기업의 수는 잠식과 도산기업을 합한 수와 비슷하게 표본을 구성하는 것이 예측력을 향상하는데 도움이 된다고 할 수 있다. 3) 속성별로 고르게 투입변수로 선정한 경우가 그렇지 않은 경우보다 더 높은 예측력을 보였다. 4) 전통적인 delta learning rule과 sigmoid함수를 사용한 역전파학습 보다는 normalized cumulative delta learning rule과 hyperbolic tangent함수를 사용한 역전파 학습이 더 높은 예측력을 보였다. 이러한 현상은 두집단문제에서 보다 세집단문제에서 더 큰 차이를 나타내고 있다.
PDF

Influence of a Choice Attribute of Hotel Banquet Event Menu on Customer Satisfaction - Focusing on the P Hotel - (호텔 연회장 이벤트 메뉴 선택 속성이 고객 만족에 미치는 영향 - P호텔을 중심으로 -)

Lee, In-Sung;Lee, Sang-Won;Lee, Kwang-Ock
- Culinary science and hospitality research
- /
- v.15 no.3
- /
- pp.15-28
- /
- 2009
The purpose of this study is to examine customer behavior when choosing event menu at the banquet restaurant of a five-star hotel and analyze the factors of choosing the menu and its customer satisfaction. Based on the results of this study, it is possible for hotel mangers to develop good banquet event menu choices and use them when changing menu. This study adopts the Enter Method, and "t" defines 3 variables such as physiological intent, quality of food, and reasonable price. However, the other factors such as sensory images of food, contents of menu, promotion menu, cleanliness and services prove not to be important variables in this study. Among most important 3 variables, quality of food with the highest figure($\beta$ .416) is the most important variable to customer satisfaction followed by physiological intent ($\beta$ .283) and resonable price($\beta$ .134).
PDF

Selected Factors Associated with Formal and Informal Learning During the First Pregnancy (첫 임신 동안 정규와 비정규 학습과 관련되는 선택된 요소들)

Lee, Connie W.
- Women's Health Nursing
- /
- v.8 no.3
- /
- pp.446-458
- /
- 2002
목적: 현 연구는 6-12주된 첫 아기를 둔 여성 152명을 대상으로 첫 임신기간 동안 참여했던 정규와 비정규 학습활동의 정도와 범위를 알아보기 위해 시행된 것이다. 설계와 방법: 연구를 위한 정보수집을 위해 다측면의 자가선택 연구도구를 사용하였고 4가지 구성개념을 측정하였다: 독립변수로 학습에 대한 성향, 학습을 위한 사회적 지지, 임신에 대한 기대, 그리고 종속변수로서 첫 임신 동안 정규와 비정규 학습활동의 참여정도. 또한 일반적 특성으로는 6가지 변수(연령, 인종, 결혼상태, 교육정도, 직업유무, 가족의 수입)를 측정하였다. 3개의 독립변수와 1개의 종속변수사이의 이변량관계를 측정하기 위해 단순회귀분석을 시행하였다. 일반적 특성과 종속변수와의 관계는 t-Test, Point Biserial, Spearman Correlation Coefficients를 이용해 조사하였고 임신 중 학습을 위한 설명적 모델을 유도하기 위해 다중회귀분석을 사용하였다. 결과: 연구결과 3가지 이변량관계가 모두 통계적으로 유의한 것으로 나타났다: 학습에 대한 성향($r^2$=.17, p=.001), 사회적 지지($r^2$=.27, p=.0001), 그리고 기대($r^2$=.17, p=.0001). 교육정도와 결혼상태는 강한 설명력을 가졌고 연령, 인종, 직업유무와 가족의 수입은 통계적으로 유의하였으나 설명력이 약한 변수들이었다. 임상과의 관계: 이러한 결과는 임신 동안 여성의 학습욕구를 만족시키기를 바라는 성인대상의 교육자들 만큼이나 건강관리제공자들을 위한 실제적인 관련성을 보여주고 있으며 또한 중요한 생활사인 임신 동안 자발적인 학습을 하려는 성인을 이해하기 위한 이론적으로 유용한 모델을 제공한다.
PDF

Multi-objective Genetic Algorithm for Variable Selection in Linear Regression Model and Application (선형회귀모델의 변수선택을 위한 다중목적 유전 알고리즘과 응용)

Kim, Dong-Il;Park, Cheong-Sool;Baek, Jun-Geol;Kim, Sung-Shick
- Journal of the Korea Society for Simulation
- /
- v.18 no.4
- /
- pp.137-148
- /
- 2009
The purpose of this study is to implement variable selection algorithm which helps construct a reliable linear regression model. If we use all candidate variables to construct a linear regression model, the significance of the model will be decreased and it will cause 'Curse of Dimensionality'. And if the number of data is less than the number of variables (dimension), we cannot construct the regression model. Due to these problems, we consider the variable selection problem as a combinatorial optimization problem, and apply GA (Genetic Algorithm) to the problem. Typical measures of estimating statistical significance are $R^2$, F-value of regression model, t-value of regression coefficients, and standard error of estimates. We design GA to solve multi-objective functions, because statistical significance of model is not to be estimated by a single measure. We perform experiments using simulation data, designed to consider various kinds of situations. As a result, it shows better performance than LARS (Least Angle Regression) which is an algorithm to solve variable selection problems. We modify algorithm to solve portfolio selection problem which construct portfolio by selecting stocks. We conclude that the algorithm is able to solve real problems.
https://doi.org/10.9709/JKSS.2009.18.4.137 인용 PDF

A Study on Optimal Identification of Fuzzy Polynomial Neural Networks Model Using Genetic Algorithms (유전자 알고리즘을 이용한 FPNN 모델의 최적 동정에 관한 연구)

이인태;박호성;오성권
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2004.10a
- /
- pp.429-432
- /
- 2004
본 논문은 기존의 퍼지 다항식 뉴럴 네트워크 (Fuzzy Polynomial Neural Networks ; FPNN) 모델을 이용하여 비선형성 데이터에 대한 추론을 제안한다. 복잡한 비선형 시스템의 모델동정을 위하여 생성된 GMDH 방법에 기초한 FPNN의 각 노드는 퍼지 규칙을 기반으로 구축되었으며, 층이 진행되는 동안 모델 스스로 노드의 선택과 제거를 통해 최적의 네트워크 구조를 생성할 수 있는 유연성을 가지고 있다. FPNN 각각의 활성노드를 퍼지다항식 뉴론(Fuzzy Polynomial Neuron ; FPN)이라고 표현한다. FPNN의 후반부 구조는 입출력 변수 사이 의 간략과 회귀다항식 (1차, 2차, 변형된 2차식) 함수에 의해 구현된다. 규칙의 전반부 멤버쉽 함수는 삼각형과 가우시안형의 멤버쉽 함수가 사용된다. 또한 유전자 알고리즘을 사용하여 각노드의 부분표현식을 구성하는 입력변수의 수, 입력변수와 차수의 선택 동조를 통하여 최적의 Genetic Algorithms(GAs)을 이용한 FPNN모델을 설계하는 것이 유용하고 효과적임을 보인다.
PDF

Groundwater level prediction model using artificial neural network technique (인공신경망기법을 이용한 지하수위 예측모형)

Chung, Il-Moon;Lee, Jeongwoo;Kim, Jitae;Park, Inchan
- Proceedings of the Korea Water Resources Association Conference
- /
- 2016.05a
- /
- pp.562-562
- /
- 2016
신경망 모형에서 학습이란 주어진 입출력시스템에 대하여 원하는 동작을 수행할 수 있도록 연결 강도를 최적의 상태로 적응(adaptation)시키는 과정을 의미한다. 따라서 강수와 지하수위의 관계를 연계시킨 인공신경망기법은 선택적으로 예측 지하수위에 영향을 미치는 변수들을 학습에 의하여 택함으로써 예측모형을 구성할 수 있다. 즉, 예측 지하수위와의 상관관계에 의하여 입력되는 변수와의 연결강도를 조정하여 매개변수 조정 및 모형의 최적화를 자동화할 수 있다. 본 연구에서는 지하수위에 영향을 주는 요소는 지하수위와 강우량이라고 가정하고, 지하수위의 입출력과정을 시계열 분석에 의하여 모형화하였으며 예측지하수위는 강우 및 지하수위의 선행조건과 매우 밀접한 관계를 갖는다. 따라서 선행강우 및 지하수위의 상태에 따라 이를 입력하여 미래의 지하수위를 예측하게 된다. 이 모형을 제주지역의 관측소에 적용한 결과 관측소별로 타당한 예측결과를 도출하였다.
PDF

The Effect on Firm's Performance of Employee Stock Option (종업원의 주식보상시스템이 기업성과에 미치는 영향)

Park, Jong-Hyuk
- Management & Information Systems Review
- /
- v.28 no.1
- /
- pp.71-97
- /
- 2009
In this study, I compare the ability of alternative accounting method for employee stock option to reflect firm value using the Ohlson's(1995) valuation model for 200 firms. The each methods, I compare are employee stock option expense recognition based on the K-GAAP disclosures, and asset recognition at the grant date based on the SFAS No. 123 Exposure Draft: Accounting for stock-based compensation. The model include: (1) a model that uses reported earnings, equity book value, and compensation expense based on the K-GAAP disclosures; (2) a model that uses pro-forma earnings, equity book value and adds a measure of the unrecognized asset arising form granting of employee stock options. Finding form estimating equations that the K-GAAP method for calculating compensation has no explanatory power, and the SFAS No.123 Draft Exposure method for arising asset and fair value compensation better captures than market's perception of the economic impact of stock options on firm values. However, the correlation of employee stock option compensation expense is positive. These results suggest that incentive benefits derived from employee stock option plans outweigh the cost associated with plan. In addition, I couldn't find evidence that company in KOSDAQ that have high growth potential benefit more from employee stock option plan compared to lager, more mature firm in SEC.
PDF

The impact of the change in the splitting method of decision trees on the prediction power (의사결정나무의 분기법 변화가 예측력에 미치는 영향)

Chang, Youngjae
- The Korean Journal of Applied Statistics
- /
- v.35 no.4
- /
- pp.517-525
- /
- 2022
In the era of big data, various data mining techniques have been proposed as major analysis methodologies. As complex and diverse data is mass-produced, data mining techniques have attracted attention as a method that forms the foundation of data science. In this paper, we focused on the decision tree, which is frequently used in practice and easy to understand as one of representative data mining methods. Specifically, we analyzed the effect of the splitting method of decision trees on the model performance. We compared the prediction power and structures of decision tree models with different split methods based on various simulated data. The results show that the linear combination split method can improve the prediction accuracy of decision trees in the case of data simulated from nonlinear models with complex structure.
https://doi.org/10.5351/KJAS.2022.35.4.517 인용 PDF KSCI

Search Result 235, Processing Time 0.037 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)