• 제목/요약/키워드: regression analysis method

검색결과 4,587건 처리시간 0.034초

Outlier Identification in Regression Analysis using Projection Pursuit

  • Kim, Hyojung;Park, Chongsun
    • Communications for Statistical Applications and Methods
    • /
    • 제7권3호
    • /
    • pp.633-641
    • /
    • 2000
  • In this paper, we propose a method to identify multiple outliers in regression analysis with only assumption of smoothness on the regression function. Our method uses single-linkage clustering algorithm and Projection Pursuit Regression (PPR). It was compared with existing methods using several simulated and real examples and turned out to be very useful in regression problem with the regression function which is far from linear.

  • PDF

통계해석에 의한 G/T 4톤급 연안어선의 유효마력 추정 (Prediction of Effective Horsepower for G/T 4 ton Class Coast Fishing Boat Using Statistical Analysis)

  • 박충환;심상목;조효제
    • 한국해양공학회지
    • /
    • 제23권6호
    • /
    • pp.71-76
    • /
    • 2009
  • This paper describes a statistical analysis method for predicting a coast fishing boat's effective horsepower. The EHP estimation method for small coast fishing boats was developed, based on a statistical regression analysis of model test results in a circulating water channel. The statistical regression formula of a fishing boat's effective horsepower is determined from the regression analysis of the resistance test results for 15 actual coast fishing boats. This method was applied to the effective horsepower prediction of a G/T 4 ton class coast fishing boat. From the estimation of the effective horsepower using this regression formula and the experimental model test of the G/T 4 ton class coast fishing boat, the estimation accuracy was verified under 10 percent of the design speed. However, the effective horsepower prediction method for coast fishing boats using the regression formula will be used at the initial design and hull-form development stage.

가중치 부여 방법에 따른 가중 비선형 회귀 쌍곡선법의 침하 예측 정확도 분석 (Settlement Prediction Accuracy Analysis of Weighted Nonlinear Regression Hyperbolic Method According to the Weighting Method)

  • 곽태영;우상인;홍성호;이주형;백성하
    • 한국지반공학회논문집
    • /
    • 제39권4호
    • /
    • pp.45-54
    • /
    • 2023
  • 설계 단계에서의 침하 예측은 주로 이론적 침하 예측 방법에 의해 수행되지만, 정확도의 문제로 인해 시공 단계에서는 주로 시간에 따른 침하량 계측 결과를 토대로 장래 침하량을 예측하는 계측 기반 침하 예측 방법을 적용하고 있다. 계측 기반 침하 예측 방법 중에서도 쌍곡선법이 주로 쓰이고 있으나 기존의 쌍곡선법은 정확도가 떨어지며 통계적 측면에서 한계점이 명확하기 때문에, 가중 비선형 회귀 분석 기반의 쌍곡선법이 제안된 바 있다. 본 연구에서는 가중 비선형 회귀 쌍곡선법에 두 가지 가중치 부여 방식을 적용하여 침하 예측 정확도를 비교 분석하였다. 부산 신항에 위치한 두 현장에서 측정한 지표침하판 데이터를 활용했으며, 회귀분석 구간을 전체 데이터에 30, 50, 70%로 설정해 나머지 구간의 침하를 예측했다. 그 결과, 가중치 부여 방식과 무관하게 쌍곡선법 기반의 침하 예측 방법은 모두 회귀 분석 구간이 증가할수록 정확도가 높게 나타났으며, 가중 비선형 회귀 쌍곡선법을 통해 기존 선형 회귀 쌍곡선법 보다 정확하게 침하를 예측할 수 있었다. 특히 더 작은 회귀분석 구간이 적용되었음에도 가중 비선형 회귀 쌍곡선법이 기존 선형 회귀 쌍곡선법에 비해 높은 침하 예측 성능을 보여, 가중 비선형 회귀 쌍곡선법을 통해 훨씬 빠르고 정확하게 침하량을 예측할 수 있음을 확인했다.

성향점수매칭 방법을 사용한 로지스틱 회귀분석에 관한 연구 (On Logistic Regression Analysis Using Propensity Score Matching)

  • 김소연;백종일
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제16권4호
    • /
    • pp.323-330
    • /
    • 2016
  • Purpose: Recently, propensity score matching method is used in a large number of research paper, nonetheless, there is no research using fitness test of before and after propensity score matching. Therefore, comparing fitness of before and after propensity score matching by logistic regression analysis using data from 'online survey of adolescent health' is the main significance of this research. Method: Data that has similar propensity in two groups is extracted by using propensity score matching then implement logistic regression analysis on before and after matching separately. Results: To test fitness of logistic regression analysis model, we use Model summary, -2Log Likelihood and Hosmer-Lomeshow methods. As a result, it is confirmed that the data after matching is more suitable for logistic regression analysis than data before matching. Conclusion: Therefore, better result which has appropriate fitness will be shown by using propensity score matching shows better result which has better fitness.

ON THEIL'S METHOD IN FUZZY LINEAR REGRESSION MODELS

  • Choi, Seung Hoe;Jung, Hye-Young;Lee, Woo-Joo;Yoon, Jin Hee
    • 대한수학회논문집
    • /
    • 제31권1호
    • /
    • pp.185-198
    • /
    • 2016
  • Regression analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper propose a fuzzy regression analysis applying Theils method which is not sensitive to outliers. This method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. An example and two simulation results are given to show fuzzy Theils estimator is more robust than the fuzzy least squares estimator.

회귀방정식과 PID제어기에 의한 DC모터 제어 (DC Motor Control using Regression Equation and PID Controller)

  • 서기영;이수흠;문상필;이내일;최종수
    • 융합신호처리학회 학술대회논문집
    • /
    • 한국신호처리시스템학회 2000년도 하계종합학술대회논문집
    • /
    • pp.129-132
    • /
    • 2000
  • We propose a new method to deal with the optimized auto-tuning for the PID controller which is used to the process -control in various fields. First of all, in this method, initial values of DC motor are determined by the Ziegler-Nichols method. Finally, after studying the parameters of PID controller by input vector of multiple regression analysis, when we give new K, L, T values to multiple regression model, the optimized parameters of PID controller is found by multiple regression analysis program.

  • PDF

Hybrid Fuzzy Least Squares Support Vector Machine Regression for Crisp Input and Fuzzy Output

  • Shim, Joo-Yong;Seok, Kyung-Ha;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • 제17권2호
    • /
    • pp.141-151
    • /
    • 2010
  • Hybrid fuzzy regression analysis is used for integrating randomness and fuzziness into a regression model. Least squares support vector machine(LS-SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate hybrid fuzzy linear and nonlinear regression models with crisp inputs and fuzzy output using weighted fuzzy arithmetic(WFA) and LS-SVM. LS-SVM allows us to perform fuzzy nonlinear regression analysis by constructing a fuzzy linear regression function in a high dimensional feature space. The proposed method is not computationally expensive since its solution is obtained from a simple linear equation system. In particular, this method is a very attractive approach to modeling nonlinear data, and is nonparametric method in the sense that we do not have to assume the underlying model function for fuzzy nonlinear regression model with crisp inputs and fuzzy output. Experimental results are then presented which indicate the performance of this method.

Regression analysis and recursive identification of the regression model with unknown operational parameter variables, and its application to sequential design

  • Huang, Zhaoqing;Yang, Shiqiong;Sagara, Setsuo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1990년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 26-27 Oct. 1990
    • /
    • pp.1204-1209
    • /
    • 1990
  • This paper offers the theory and method for regression analysis of the regression model with operational parameter variables based on the fundamentals of mathematical statistics. Regression coefficients are usually constants related to the problem of regression analysis. This paper considers that regression coefficients are not constants but the functions of some operational parameter variables. This is a kind of method of two-step fitting regression model. The second part of this paper considers the experimental step numbers as recursive variables, the recursive identification with unknown operational parameter variables, which includes two recursive variables, is deduced. Then the optimization and the recursive identification are combined to obtain the sequential experiment optimum design with operational parameter variables. This paper also offers a fast recursive algorithm for a large number of sequential experiments.

  • PDF

라소를 이용한 간편한 주성분분석 (Simple principal component analysis using Lasso)

  • 박철용
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권3호
    • /
    • pp.533-541
    • /
    • 2013
  • 이 연구에서는 라소를 이용한 간편한 주성분분석을 제안한다. 이 방법은 다음의 두 단계로 구성되어 있다. 먼저 주성분분석에 의해 주성분을 구한다. 다음으로 각 주성분을 반응변수로 하고 원자료를 설명변수로 하는 라소 회귀모형에 의한 회귀계수 추정량을 구한다. 이 회귀계수 추정량에 기반한 새로운 주성분을 사용한다. 이 방법은 라소 회귀분석의 성질에 의해 회귀계수 추정량이 보다 쉽게 0이 될 수 있기 때문에 해석이 쉬운 장점이 있다. 왜냐하면 주성분을 반응변수로 하고 원자료를 설명변수로 하는 회귀모형의 회귀계수가 고유벡터가 되기 때문이다. 라소 회귀모형을 위한 R 패키지를 이용하여 모의생성된 자료와 실제 자료에 이 방법을 적용하여 유용성을 보였다.

Robustness of model averaging methods for the violation of standard linear regression assumptions

  • Lee, Yongsu;Song, Juwon
    • Communications for Statistical Applications and Methods
    • /
    • 제28권2호
    • /
    • pp.189-204
    • /
    • 2021
  • In a regression analysis, a single best model is usually selected among several candidate models. However, it is often useful to combine several candidate models to achieve better performance, especially, in the prediction viewpoint. Model combining methods such as stacking and Bayesian model averaging (BMA) have been suggested from the perspective of averaging candidate models. When the candidate models include a true model, it is expected that BMA generally gives better performance than stacking. On the other hand, when candidate models do not include the true model, it is known that stacking outperforms BMA. Since stacking and BMA approaches have different properties, it is difficult to determine which method is more appropriate under other situations. In particular, it is not easy to find research papers that compare stacking and BMA when regression model assumptions are violated. Therefore, in the paper, we compare the performance among model averaging methods as well as a single best model in the linear regression analysis when standard linear regression assumptions are violated. Simulations were conducted to compare model averaging methods with the linear regression when data include outliers and data do not include them. We also compared them when data include errors from a non-normal distribution. The model averaging methods were applied to the water pollution data, which have a strong multicollinearity among variables. Simulation studies showed that the stacking method tends to give better performance than BMA or standard linear regression analysis (including the stepwise selection method) in the sense of risks (see (3.1)) or prediction error (see (3.2)) when typical linear regression assumptions are violated.