• 제목/요약/키워드: Regression models

검색결과 3,638건 처리시간 0.027초

Assessment of slope stability using multiple regression analysis

  • Marrapu, Balendra M.;Jakka, Ravi S.
    • Geomechanics and Engineering
    • /
    • 제13권2호
    • /
    • pp.237-254
    • /
    • 2017
  • Estimation of slope stability is a very important task in geotechnical engineering. However, its estimation using conventional and soft computing methods has several drawbacks. Use of conventional limit equilibrium methods for the evaluation of slope stability is very tedious and time consuming, while the use of soft computing approaches like Artificial Neural Networks and Fuzzy Logic are black box approaches. Multiple Regression (MR) analysis provides an alternative to conventional and soft computing methods, for the evaluation of slope stability. MR models provide a simplified equation, which can be used to calculate critical factor of safety of slopes without adopting any iterative procedure, thereby reducing the time and complexity involved in the evaluation of slope stability. In the present study, a multiple regression model has been developed and tested its accuracy in the estimation of slope stability using real field data. Here, two separate multiple regression models have been developed for dry and wet slopes. Further, the accuracy of these developed models have been compared and validated with respect to conventional limit equilibrium methods in terms of Mean Square Error (MSE) & Coefficient of determination ($R^2$). As the developed MR models here are not based on any region specific data and covers wide range of parametric variations, they can be directly applied to any real slopes.

고차원 선형 및 로지스틱 회귀모형에 대한 변분 베이즈 방법 소개 (Introduction to variational Bayes for high-dimensional linear and logistic regression models)

  • 장인송;이경재
    • 응용통계연구
    • /
    • 제35권3호
    • /
    • pp.445-455
    • /
    • 2022
  • 본 논문에서는 고차원 희소 회귀분석을 위한 기존의 베이지안 방법들을 소개하고, 다양한 모의실험 세팅에서 성능을 비교한다. 특히, 확장 가능하고 정확한 베이지안 추론을 가능하게 하는 변분 베이즈 방법(variational Bayes method) (Ray와 Szabó, 2021) 에 중점을 둔다. 시뮬레이션 자료를 기반으로 한 희소 고차원 선형회귀분석을 실시하고 변분 베이즈 방법의 성능을 다른 베이지안 및 빈도론 방법들과 비교한다. 로지스틱 회귀분석에서 변분 베이즈 방법의 실제 성능을 확인하기 위해 백혈병 유전자 발현 자료를 사용하여 실자료 분석을 수행한다.

토빗모형을 이용한 교차로 보행자 사고모형 개발 (Developing the Pedestrian Accident Models of Intersections using Tobit Model)

  • 이승주;임진강;박병호
    • 한국안전학회지
    • /
    • 제29권5호
    • /
    • pp.154-159
    • /
    • 2014
  • This study deals with the pedestrian accidents of intersections in case of Cheongju. The objective is to develop the pedestrian accident models using Tobit regression model. In pursuing the above, the pedestrian accident data from 2007 to 2011 were collected from TAAS data set of Road Traffic Authority. To analyze the accident, Poisson, negative binomial and Tobit regression models were utilized in this study. The dependent variable were the number of accident by intersection. Independent variables are traffic volume, intersection geometric structure and the transportation facility. The main results were as follows. First, Tobit model was judged to be more appropriate model than other models. Also, these models were analyzed to be statistically significant. Second, such the main variables related to accidents as traffic volume, pedestrian volume, number of traffic island, crossing length and the pedestrian countdown signal systems were adopted in the above model.

영과잉 회귀모형에 대한 베이지안 분석 (Bayesian Analysis for the Zero-inflated Regression Models)

  • 장학진;강윤회;이수범;김성욱
    • 응용통계연구
    • /
    • 제21권4호
    • /
    • pp.603-613
    • /
    • 2008
  • 셀 수 있는 이산 자료 중에서 일반적인 모형에 비하여 영의 빈도가 과도하게 많이 관측되는 자료가 있다. 이러한 경우에 포아송 또는 음이항회귀모형과 같은 일반적인 회귀모형에 의한 분석은 적절하지 못하다. 본 논문에서는 영과잉 포아송회귀모형과 영과잉 음이항회귀모형에 대하여 베이지안 분석을 하였다. 또한, 마코브 연쇄 몬테카롤로 방법으로 계산한 베이즈 요인을 이용하여 모형선택을 하였다. 실제 교통사고 자료를 분석하여 이론적인 결과들을 뒷받침하였다.

Tree-Structured Nonlinear Regression

  • Chang, Young-Jae;Kim, Hyeon-Soo
    • 응용통계연구
    • /
    • 제24권5호
    • /
    • pp.759-768
    • /
    • 2011
  • Tree algorithms have been widely developed for regression problems. One of the good features of a regression tree is the flexibility of fitting because it can correctly capture the nonlinearity of data well. Especially, data with sudden structural breaks such as the price of oil and exchange rates could be fitted well with a simple mixture of a few piecewise linear regression models. Now that split points are determined by chi-squared statistics related with residuals from fitting piecewise linear models and the split variable is chosen by an objective criterion, we can get a quite reasonable fitting result which goes in line with the visual interpretation of data. The piecewise linear regression by a regression tree can be used as a good fitting method, and can be applied to a dataset with much fluctuation.

경쟁 위험 회귀 모형의 이해와 추정 방법 (Estimation methods and interpretation of competing risk regression models)

  • 김미정
    • 응용통계연구
    • /
    • 제29권7호
    • /
    • pp.1231-1246
    • /
    • 2016
  • 경쟁위험에 대한 연구 중 주로 쓰이는 방법은 Cause-specific 위험 모형과 subdistribution을 이용한 비례 위험 모형 방법이다. 그 이후에도 많은 모형이 제시되었지만, 추정 방법 면에서 설명력이 부족하거나 알고리즘으로 구현하기 어려운 단점을 가지고 있어서 잘 활용되고 있지 않다. 이 논문에서는 Cause-specific 위험 모형, subdistribution을 이용한 비례 위험 모형과 비교적 최근에 제시된 이항 회귀 모형(direct binomial model), 절대 위험 회귀 모형(absolute risk regression model), Eriksson 등 (2015)의 비례 오즈 모형(proportional odds model)을 소개하고 추정 방법을 간단히 설명하고자 한다. 각 모형에 대하여 SAS와 R을 이용한 활용 방법을 제시하고, 두 가지 경쟁위험이 존재하는 데이터를 R을 이용하여 분석하였다.

근적외 스펙트럼을 이용한 정량분석용 최적 주성분회귀모델을 얻기 위한 알고리듬 (Algorithm for Finding the Best Principal Component Regression Models for Quantitative Analysis using NIR Spectra)

  • 조정환
    • Journal of Pharmaceutical Investigation
    • /
    • 제37권6호
    • /
    • pp.377-395
    • /
    • 2007
  • Near infrared(NIR) spectral data have been used for the noninvasive analysis of various biological samples. Nonetheless, absorption bands of NIR region are overlapped extensively. It is very difficult to select the proper wavelengths of spectral data, which give the best PCR(principal component regression) models for the analysis of constituents of biological samples. The NIR data were used after polynomial smoothing and differentiation of 1st order, using Savitzky-Golay filters. To find the best PCR models, all-possible combinations of available principal components from the given NIR spectral data were derived by in-house programs written in MATLAB codes. All of the extensively generated PCR models were compared in terms of SEC(standard error of calibration), $R^2$, SEP(standard error of prediction) and SECP(standard error of calibration and prediction) to find the best combination of principal components of the initial PCR models. The initial PCR models were found by SEC or Malinowski's indicator function and a priori selection of spectral points were examined in terms of correlation coefficients between NIR data at each wavelength and corresponding concentrations. For the test of the developed program, aqueous solutions of BSA(bovine serum albumin) and glucose were prepared and analyzed. As a result, the best PCR models were found using a priori selection of spectral points and the final model selection by SEP or SECP.

Design models for predicting shear resistance of studs in solid concrete slabs based on symbolic regression with genetic programming

  • Degtyarev, Vitaliy V.;Hicks, Stephen J.;Hajjar, Jerome F.
    • Steel and Composite Structures
    • /
    • 제43권3호
    • /
    • pp.293-309
    • /
    • 2022
  • Accurate design models for predicting the shear resistance of headed studs in solid concrete slabs are essential for obtaining economical and safe steel-concrete composite structures. In this study, symbolic regression with genetic programming (GPSR) was applied to experimental data to formulate new descriptive equations for predicting the shear resistance of studs in solid slabs using both normal and lightweight concrete. The obtained GPSR-based nominal resistance equations demonstrated good agreement with the test results. The equations indicate that the stud shear resistance is insensitive to the secant modulus of elasticity of concrete, which has been included in many international standards following the pioneering work of Ollgaard et al. In contrast, it increases when the stud height-to-diameter ratio increases, which is not reflected by the design models in the current international standards. The nominal resistance equations were subsequently refined for use in design from reliability analyses to ensure that the target reliability index required by the Eurocodes was achieved. Resistance factors for the developed equations were also determined following US design practice. The stud shear resistance predicted by the proposed models was compared with the predictions from 13 existing models. The accuracy of the developed models exceeds the accuracy of the existing equations. The proposed models produce predictions that can be used with confidence in design, while providing significantly higher stud resistances for certain combinations of variables than those computed with the existing equations given by many standards.

실시간 수위 예측을 위한 다중선형회귀 모형의 비교 (Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting)

  • 최승용;한건연;김병현
    • 대한토목학회논문집
    • /
    • 제32권1B호
    • /
    • pp.9-20
    • /
    • 2012
  • 최근 수위 예측을 위한 개념적 기반, 수문학적, 물리적 기반 모형 등의 단점을 극복하고자 홍수예측을 위해 자료지향형 모형 중의 하나인 다중선형회귀 모형이 널리 도입되고 있다. 본 연구의 목적은 이러한 다중선형회귀 모형의 서로 다른 회귀계수 선정 방법에 따른 홍수예측 성능을 비교 검토하고 이를 통해 적절한 다중회귀 홍수예측 모형을 구축하는 것이다. 이를 위해 입력자료의 자기상관분석을 통해 독립변수의 시간 규모를 결정한 후 최소 자승법, 가중 최소 자승법, 단계별 선택법의 각기 다른 회귀계수 산정 방법을 이용한 홍수예측 모형을 구축하고 중랑천 유역의 다양한 홍수사상에 대해 적용하였다. 구축된 모형들의 성능을 평가하기 위해 평균제곱근오차, Nash-Suttcliffe 효율계수, 평균절대오차, 수정 결정계수와 같이 4개의 통계지표들을 사용하였다. 모의결과 단계별 선택법을 이용한 다중선형회귀 홍수예측 모형이 가장 정확한 예측 결과를 보였고, 최소자승법을 이용한 홍수예측 모형이 가중 최소자승법을 이용한 홍수예측 모형보다 좀 더 나은 예측 결과를 나타냈다.

LACTATION CURVE OF HOLSTEIN FRIESIAN COWS IN THE KINGDOM OF SAUDI ARABIA

  • Ali, A.K.A.;Al-Jumaah, R.S.;Hayes, E.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제9권4호
    • /
    • pp.439-447
    • /
    • 1996
  • Monthly test day production for 12,020 records, were collected from six of the largest specialized dairy farms located in central region of the Kingdom of Saudi Arabia. The records described lactating cows in four parities and two seasons of calving. Monthly test day records were fitted using Wood's model $At{{^b}{_e}}^{-ct}$ with multiple and additive error term. Linear and non-linear regression models were used to find the estimates of the parameters necessary to draw the lactation curves. The shape of the lactation curves of different parities showed that third lactation has the heighest peak (43.08 kg) for linear regression model and (42.08 kg) for non-linear regression model. Fourth lactation has the lowest peak (24.00kg) for linear regression model and (25.64 kg) for non-linear regression models. Cows of second and third lactations reached the peak at 58 day for both linear and non-linear regression models. Cows of first lactation were more persistent and had late peak at 68 and 67 days for both models respectively. While, third lactation cows were lower persistent and had early peak at 58 day for both models. Cows calved at winter months have higher starting values (A), higher ascending slope (b) and higher decending slope (c). Least square means of milk yield of the first four parities and for overall data were 6,653, 7,659, 7,482, 6,988 and 7,614 kg respectively. The corresponding lactation period were 358, 367, 350, 363 and 364 days respectively.