• Title/Summary/Keyword: regression analysis method

Search Result 4,587, Processing Time 0.032 seconds

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.4
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

Support Vector Machine for Interval Regression

  • Hong Dug Hun;Hwang Changha
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.67-72
    • /
    • 2004
  • Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.

  • PDF

Energy Regression Analysis for Economic Evaluation of Cooling Plants (냉방열원의 경제성 평가를 위한 건물에너지 회귀식 산출)

  • 김영섭;김강수
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.14 no.5
    • /
    • pp.377-384
    • /
    • 2002
  • For economic evaluation of cooling plant equipments, it is necessary to simplify energy Prediction method, which should includes efficiency corrected by part-load ratio. This study proposed simplified method with regression equations of time-average partial loads and refrigerator capacity. DOE-2 Program was used to carry out a parametric study of twelve design variables. Five input variables were considered to be significant and were used in the regression equations. To test accuracy of simplified method, calculated results were compared with DOE-2 simulated results. Test result showes a good agreement with the simulation result with an error of 5.9∼7.6%. It is expected that this method can be used as an easy prediction tool for comparing energy use of different cooling plants during the early design stage.

Estimation of Total Sound Pressure Level for Friction Noise Regarding a Driving Vehicle using the Extended Kalman Filter Algorithm (확장형 칼만필터 알고리즘을 활용한 차량 주행에 따른 마찰소음의 총 음압레벨 예측)

  • Dowan, Kim;Beomsoo, Han;Sungho, Mun;Deok-Soon, An
    • International Journal of Highway Engineering
    • /
    • v.16 no.5
    • /
    • pp.59-66
    • /
    • 2014
  • PURPOSES : This study is to predict the Sound Pressure Level(SPL) obtained from the Noble Close ProXimity(NCPX) method by using the Extended Kalman Filter Algorithm employing the taylor series and Linear Regression Analysis based on the least square method. The objective of utilizing EKF Algorithm is to consider stochastically the effect of error because the Regression analysis is not the method for the statical approach. METHODS : For measuring the friction noise between the surface and vehicle's tire, NCPX method was used. With NCPX method, SPL can be obtained using the frequency analysis such as Discrete Fourier Transform(DFT), Fast Fourier Transform(FFT) and Constant Percentage Bandwidth(CPB) Analysis. In this research, CPB analysis was only conducted for deriving A-weighted SPL from the sound power level in terms of frequencies. EKF Algorithm and Regression analysis were performed for estimating the SPL regarding the vehicle velocities. RESULTS : The study has shown that the results related to the coefficient of determination and RMSE from EKF Algorithm have been improved by comparing to Regression analysis. CONCLUSIONS : The more the vehicle is fast, the more the SPL must be high. But in the results of EKF Algorithm, SPLs are irregular. The reason of that is the EKF algorithm can be reflected by the error covariance from the measurements.

A comparative study of the Gini coefficient estimators based on the regression approach

  • Mirzaei, Shahryar;Borzadaran, Gholam Reza Mohtashami;Amini, Mohammad;Jabbari, Hadi
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.4
    • /
    • pp.339-351
    • /
    • 2017
  • Resampling approaches were the first techniques employed to compute a variance for the Gini coefficient; however, many authors have shown that an analysis of the Gini coefficient and its corresponding variance can be obtained from a regression model. Despite the simplicity of the regression approach method to compute a standard error for the Gini coefficient, the use of the proposed regression model has been challenging in economics. Therefore in this paper, we focus on a comparative study among the regression approach and resampling techniques. The regression method is shown to overestimate the standard error of the Gini index. The simulations show that the Gini estimator based on the modified regression model is also consistent and asymptotically normal with less divergence from normal distribution than other resampling techniques.

Bayesian Semi-Parametric Regression for Quantile Residual Lifetime

  • Park, Taeyoung;Bae, Wonho
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.4
    • /
    • pp.285-296
    • /
    • 2014
  • The quantile residual life function has been effectively used to interpret results from the analysis of the proportional hazards model for censored survival data; however, the quantile residual life function is not always estimable with currently available semi-parametric regression methods in the presence of heavy censoring. A parametric regression approach may circumvent the difficulty of heavy censoring, but parametric assumptions on a baseline hazard function can cause a potential bias. This article proposes a Bayesian semi-parametric regression approach for inference on an unknown baseline hazard function while adjusting for available covariates. We consider a model-based approach but the proposed method does not suffer from strong parametric assumptions, enjoying a closed-form specification of the parametric regression approach without sacrificing the flexibility of the semi-parametric regression approach. The proposed method is applied to simulated data and heavily censored survival data to estimate various quantile residual lifetimes and adjust for important prognostic factors.

Categorical Analysis for the Factors of Incustrial Accident Cases (산업재해 사례인자의 범주형 분석)

  • Jhee, Kyung-Tek;Song, Young-Ho;Chung, Kook-Sam
    • Journal of the Korean Society of Safety
    • /
    • v.17 no.1
    • /
    • pp.94-98
    • /
    • 2002
  • This study aimed to search for the fundamental accident causes using a categorical analysis, a kind of statistical methods. As the analysis methods, correlation analysis, independence test and logistic regression analysis were used. And the SPSS package, a general-purpose mathematical library, was used to obtain statistical characteristics. As the result of this study, the accident causes associated with factor of 'lost working days' were factors such as 'employed periods', 'sex', 'type of accident', 'month'. In case of applying independence test method, the most important cause was the factor of 'month'. In case that logistic regression analysis method was applied, the cause contributed to the increase structure'. 'less than 6 month'. On the basis of these results, the plan for accident prevention and the proper investment for accident prevention expenditure could be carried out in each workshop.

Calibration of the Ridge Regression Model with the Genetic Algorithm:Study on the Regional Flood Frequency Analysis (유전알고리즘을 이용한 능형회귀모형의 검정 : 빈도별 홍수량의 지역분석을 대상으로)

  • Seong, Gi-Won
    • Journal of Korea Water Resources Association
    • /
    • v.31 no.1
    • /
    • pp.59-69
    • /
    • 1998
  • A regression model with basin physiographic characteristics as independent variables was calibrated for regional flood frequency analysis. In case that high correlations existing among the independent variables the ridge regression has been known to have capability of overcoming the problems of multicollinearity. To optimize the ridge regression model the cost function including regularization parameter must be minimized. In this research the genetic algorithm was applied on this optimization problem. The genetic algorithm is a stochastic search method that mimic the metaphor of natural biological heredity. Using this method the regression model could have optimized and stable weights of variables.

  • PDF

Competing Risks Regression Analysis (경쟁적 위험하에서의 회귀분석)

  • Baik, Jaiwook
    • Journal of Applied Reliability
    • /
    • v.18 no.2
    • /
    • pp.130-142
    • /
    • 2018
  • Purpose: The purpose of this study is to introduce regression method in the presence of competing risks and to show how you can use the method with hypothetical data. Methods: Survival analysis has been widely used in biostatistics division. But the same method has not been utilized in reliability division. Especially competing risks, where more than a couple of causes of failure occur and the occurrence of one event precludes the occurrence of the other events, are scattered in reliability field. But they are not utilized in the area of reliability or they are analysed in the wrong way. Specifically Kaplan-Meier method is used to calculate the probability of failure in the presence of competing risks, thereby overestimating the real probability of failure. Hence, cumulative incidence function is introduced. In addition, sample competing risks data are analysed using cumulative incidence function along with some graphs. Lastly we compare cumulative incidence functions with regression type analysis briefly. Results: We used cumulative incidence function to calculate the survival probability or failure probability in the presence of competing risks. We also drew some useful graphs depicting the failure trend over the lifetime. Conclusion: This research shows that Kaplan-Meier method is not appropriate for the evaluation of survival or failure over the course of lifetime in the presence of competing risks. Cumulative incidence function is shown to be useful in stead. Some graphs using the cumulative incidence functions are also shown to be informative.

Non-Response Imputation for Panel Data (패널자료의 무응답 대체법)

  • Pak, Gi-Deok;Shin, Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.6
    • /
    • pp.899-907
    • /
    • 2010
  • Several non-response imputation methods are suggested, however, mainly cross-sectional imputations are studied and applied to this analysis. A simple and common imputation method for panel data is the cross-wave regression imputation or carry-over imputation as a special case of cross-wave regression imputation. This study suggests a multiple imputation method combined time series analysis and cross-sectional multiple imputation method. We compare this method and the cross-wave regression imputation method using MSE, MAE, and Bias. The 2008 monthly labor survey data is used for this study.