• 제목/요약/키워드: regression analysis method

검색결과 4,587건 처리시간 0.028초

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권4호
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

Support Vector Machine for Interval Regression

  • Hong Dug Hun;Hwang Changha
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2004년도 학술발표논문집
    • /
    • pp.67-72
    • /
    • 2004
  • Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.

  • PDF

냉방열원의 경제성 평가를 위한 건물에너지 회귀식 산출 (Energy Regression Analysis for Economic Evaluation of Cooling Plants)

  • 김영섭;김강수
    • 설비공학논문집
    • /
    • 제14권5호
    • /
    • pp.377-384
    • /
    • 2002
  • For economic evaluation of cooling plant equipments, it is necessary to simplify energy Prediction method, which should includes efficiency corrected by part-load ratio. This study proposed simplified method with regression equations of time-average partial loads and refrigerator capacity. DOE-2 Program was used to carry out a parametric study of twelve design variables. Five input variables were considered to be significant and were used in the regression equations. To test accuracy of simplified method, calculated results were compared with DOE-2 simulated results. Test result showes a good agreement with the simulation result with an error of 5.9∼7.6%. It is expected that this method can be used as an easy prediction tool for comparing energy use of different cooling plants during the early design stage.

확장형 칼만필터 알고리즘을 활용한 차량 주행에 따른 마찰소음의 총 음압레벨 예측 (Estimation of Total Sound Pressure Level for Friction Noise Regarding a Driving Vehicle using the Extended Kalman Filter Algorithm)

  • 김도완;한범수;문성호;안덕순
    • 한국도로학회논문집
    • /
    • 제16권5호
    • /
    • pp.59-66
    • /
    • 2014
  • PURPOSES : This study is to predict the Sound Pressure Level(SPL) obtained from the Noble Close ProXimity(NCPX) method by using the Extended Kalman Filter Algorithm employing the taylor series and Linear Regression Analysis based on the least square method. The objective of utilizing EKF Algorithm is to consider stochastically the effect of error because the Regression analysis is not the method for the statical approach. METHODS : For measuring the friction noise between the surface and vehicle's tire, NCPX method was used. With NCPX method, SPL can be obtained using the frequency analysis such as Discrete Fourier Transform(DFT), Fast Fourier Transform(FFT) and Constant Percentage Bandwidth(CPB) Analysis. In this research, CPB analysis was only conducted for deriving A-weighted SPL from the sound power level in terms of frequencies. EKF Algorithm and Regression analysis were performed for estimating the SPL regarding the vehicle velocities. RESULTS : The study has shown that the results related to the coefficient of determination and RMSE from EKF Algorithm have been improved by comparing to Regression analysis. CONCLUSIONS : The more the vehicle is fast, the more the SPL must be high. But in the results of EKF Algorithm, SPLs are irregular. The reason of that is the EKF algorithm can be reflected by the error covariance from the measurements.

A comparative study of the Gini coefficient estimators based on the regression approach

  • Mirzaei, Shahryar;Borzadaran, Gholam Reza Mohtashami;Amini, Mohammad;Jabbari, Hadi
    • Communications for Statistical Applications and Methods
    • /
    • 제24권4호
    • /
    • pp.339-351
    • /
    • 2017
  • Resampling approaches were the first techniques employed to compute a variance for the Gini coefficient; however, many authors have shown that an analysis of the Gini coefficient and its corresponding variance can be obtained from a regression model. Despite the simplicity of the regression approach method to compute a standard error for the Gini coefficient, the use of the proposed regression model has been challenging in economics. Therefore in this paper, we focus on a comparative study among the regression approach and resampling techniques. The regression method is shown to overestimate the standard error of the Gini index. The simulations show that the Gini estimator based on the modified regression model is also consistent and asymptotically normal with less divergence from normal distribution than other resampling techniques.

Bayesian Semi-Parametric Regression for Quantile Residual Lifetime

  • Park, Taeyoung;Bae, Wonho
    • Communications for Statistical Applications and Methods
    • /
    • 제21권4호
    • /
    • pp.285-296
    • /
    • 2014
  • The quantile residual life function has been effectively used to interpret results from the analysis of the proportional hazards model for censored survival data; however, the quantile residual life function is not always estimable with currently available semi-parametric regression methods in the presence of heavy censoring. A parametric regression approach may circumvent the difficulty of heavy censoring, but parametric assumptions on a baseline hazard function can cause a potential bias. This article proposes a Bayesian semi-parametric regression approach for inference on an unknown baseline hazard function while adjusting for available covariates. We consider a model-based approach but the proposed method does not suffer from strong parametric assumptions, enjoying a closed-form specification of the parametric regression approach without sacrificing the flexibility of the semi-parametric regression approach. The proposed method is applied to simulated data and heavily censored survival data to estimate various quantile residual lifetimes and adjust for important prognostic factors.

산업재해 사례인자의 범주형 분석 (Categorical Analysis for the Factors of Incustrial Accident Cases)

  • 지경택;송영호;정국삼
    • 한국안전학회지
    • /
    • 제17권1호
    • /
    • pp.94-98
    • /
    • 2002
  • This study aimed to search for the fundamental accident causes using a categorical analysis, a kind of statistical methods. As the analysis methods, correlation analysis, independence test and logistic regression analysis were used. And the SPSS package, a general-purpose mathematical library, was used to obtain statistical characteristics. As the result of this study, the accident causes associated with factor of 'lost working days' were factors such as 'employed periods', 'sex', 'type of accident', 'month'. In case of applying independence test method, the most important cause was the factor of 'month'. In case that logistic regression analysis method was applied, the cause contributed to the increase structure'. 'less than 6 month'. On the basis of these results, the plan for accident prevention and the proper investment for accident prevention expenditure could be carried out in each workshop.

유전알고리즘을 이용한 능형회귀모형의 검정 : 빈도별 홍수량의 지역분석을 대상으로 (Calibration of the Ridge Regression Model with the Genetic Algorithm:Study on the Regional Flood Frequency Analysis)

  • 성기원
    • 한국수자원학회논문집
    • /
    • 제31권1호
    • /
    • pp.59-69
    • /
    • 1998
  • 빈도별 홍수량의 지역분석을 위하여 유역의 지형특성을 독립변수로 이용하는 회귀모형을 검정하였다. 그런데 이들 독립변수들간의 상관관계가 존재할 경우 능형회귀모형이 이용되기도 하는 이 방법은 다중공선성 문제를 극복하는데 적합한 방법으로 알려져 있다. 능형회귀모형을 최적화하기 위해서는 조정변수가 포함되는 비용함수를 최소화하여야 한다. 본 연구에서는 이 최적화를 위하여 유전알고리즘을 이용하였다. 유전알고리즘은 자연 생물의 유전 및 진화과정을 모방한 추계학적 탐색방법을 말한다. 이러한 유전알고리즘을 이용하여 지역분석 모형을 검정한 결과 안정된 매개변수의 가중치를 얻을 수 있었다.

  • PDF

경쟁적 위험하에서의 회귀분석 (Competing Risks Regression Analysis)

  • 백재욱
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제18권2호
    • /
    • pp.130-142
    • /
    • 2018
  • Purpose: The purpose of this study is to introduce regression method in the presence of competing risks and to show how you can use the method with hypothetical data. Methods: Survival analysis has been widely used in biostatistics division. But the same method has not been utilized in reliability division. Especially competing risks, where more than a couple of causes of failure occur and the occurrence of one event precludes the occurrence of the other events, are scattered in reliability field. But they are not utilized in the area of reliability or they are analysed in the wrong way. Specifically Kaplan-Meier method is used to calculate the probability of failure in the presence of competing risks, thereby overestimating the real probability of failure. Hence, cumulative incidence function is introduced. In addition, sample competing risks data are analysed using cumulative incidence function along with some graphs. Lastly we compare cumulative incidence functions with regression type analysis briefly. Results: We used cumulative incidence function to calculate the survival probability or failure probability in the presence of competing risks. We also drew some useful graphs depicting the failure trend over the lifetime. Conclusion: This research shows that Kaplan-Meier method is not appropriate for the evaluation of survival or failure over the course of lifetime in the presence of competing risks. Cumulative incidence function is shown to be useful in stead. Some graphs using the cumulative incidence functions are also shown to be informative.

패널자료의 무응답 대체법 (Non-Response Imputation for Panel Data)

  • 박기덕;신기일
    • Communications for Statistical Applications and Methods
    • /
    • 제17권6호
    • /
    • pp.899-907
    • /
    • 2010
  • 무응답 대체(non-response imputation) 방법에 관한 많은 이론과 방법이 제안되었으며 실제 자료 분석에 이용되고 있다. 흔히 횡단면 무응답 대체를 위하여 다중대체법(multiple imputation)이 사용되고 있으며 2차년도 이상의 패널자료에는 종시점회귀대체법(cross-wave regression imputation)이 사용되고 있다. 본 연구에서는 패널자료 분석을 위하여 종시점회귀대체법의 일반형태인 시계열 대체법과 횡단면 무응답 대체법을 결합한 시계열-횡단면 다중 대체법을 제안하였다. 노동부의 매월노동통계 자료를 이용하여 제안한 방법과 기존의 종시점회귀대체법을 비교하여 우수함을 보였다.