• Title/Summary/Keyword: 확률적 회귀모형

Search Result 184, Processing Time 0.026 seconds

Derivation of Probability Plot Correlation Coefficient Test Statistics and Regression Equation for the GEV Model based on L-moments (L-모멘트 법 기반의 GEV 모형을 위한 확률도시 상관계수 검정 통계량 유도 및 회귀식 산정)

  • Ahn, Hyunjun;Jeong, Changsam;Heo, Jun-Haeng
    • Journal of Korean Society of Disaster and Security
    • /
    • v.13 no.1
    • /
    • pp.1-11
    • /
    • 2020
  • One of the important problem in statistical hydrology is to estimate the appropriated probability distribution for a given sample data. For the problem, a goodness-of-fit test is conducted based on the similarity between estimated probability distribution and assumed theoretical probability distribution. Probability plot correlation coefficient test (PPCC) is one of the goodness-of-fit test method. PPCC has high rejection power and its application is simple. In this study, test statistics of PPCC were derived for generalized extreme value distribution (GEV) models based on L-moments and these statistics were suggested by the multiple and nonlinear regression equations for its usability. To review the rejection power of the newly proposed method in this study, Monte Carlo simulation was performed with other goodness-of-fit tests including the existing PPCC test. The results showed that PPCC-A test which is proposed in this study demonstrated better rejection power than other methods, including the existing PPCC test. It is expected that the new method will be helpful to estimate the appropriate probability distribution model.

Functional regression approach to traffic analysis (함수회귀분석을 통한 교통량 예측)

  • Lee, Injoo;Lee, Young K.
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.773-794
    • /
    • 2021
  • Prediction of vehicle traffic volume is very important in planning municipal administration. It may help promote social and economic interests and also prevent traffic congestion costs. Traffic volume as a time-varying trajectory is considered as functional data. In this paper we study three functional regression models that can be used to predict an unseen trajectory of traffic volume based on already observed trajectories. We apply the methods to highway tollgate traffic volume data collected at some tollgates in Seoul, Chuncheon and Gangneung. We compare the prediction errors of the three models to find the best one for each of the three tollgate traffic volumes.

Development of Calibration Equation Considering Uncertainty of Rainfall-Runoff Model (강우-유출 모형의 불확실성을 고려한 확률홍수량의 보정식 개발)

  • Chae, Byung-Seok;Park, Dong-Hyeok;Lee, Jin-Young;Kim, Tae-Woong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.396-396
    • /
    • 2017
  • 최근 기후변화에 효과적으로 대응하기 위해서 신뢰성 높은 설계홍수량을 산정할 필요성이 커지고 있다. 설계홍수량 산정법은 홍수빈도해석법과 설계강우법으로 대별된다. 홍수빈도해석법은 홍수량 자료에 대한 통계학적 빈도분석을 실시하여 확률홍수량을 산정하는 방법이다. 홍수빈도해석법은 관측된 자료를 활용하기 때문에 이론적으로 불확실성이 상대적으로 작은 장점을 가지고 있지만, 자료의 수가 적거나 시간에 따라 변하는 유역특성에 대한 불확실성을 함께 고려해야 한다. 관측 유량 자료가 없거나 적은 유역에서는 설계강우법이 주로 사용되고 있다. 설계강우법은 강우자료에 대해서 빈도분석을 실시하여 확률강우량을 산정한 후, 이를 강우-유출 모형에 적용하여 확률 홍수 수문곡선을 작성하고 첨두치를 확률홍수량으로 선정하는 방법이다. 그러나, 설계강우법도 강우-유출 모형에서 유역특성을 나타내는 매개변수 추정과정에서 불확실성을 내포하고 있기 때문에 추정된 홍수량 결과에 대한 불확실성을 감안해야 한다. 또한, 강우량과 홍수량의 발생빈도가 같다는 가정의 명확한 근거가 없다. 더욱이 두 가지 설계홍수량 산정법을 같은 유역에 적용하는 경우라도 종종 매우 다른 결과값을 나타낸다. 따라서, 본 연구에서는 국내 유역의 현실을 고려하여 설계강우법으로 산정된 확률홍수량을 홍수빈도해석법으로 산정된 확률홍수량을 변환할 수 있는 보정식을 개발하였다. 국내 9개의 댐 유역에서 확보된 일 단위 강우량 및 유출량 자료를 홍수빈도해석법과 설계강우법을 적용하여 대상 유역의 설계홍수량을 산정하였다. 그리고, 홍수빈도해석법으로 산정된 설계홍수량을 참값이라 가정한 후, 산정된 설계홍수량의 대상 유역별 오차율을 산정하였다. 이를 바탕으로 홍수빈도해석법과 설계강우법으로 산정된 설계홍수량 간의 관계를 회귀분석을 통하여 설계강우법으로 산정된 확률홍수량을 보정하는 관계식을 제시하였다.

  • PDF

Analysis of Bus Accidents Influential Factors on Bus Exclusive Lane in Seoul (Bus Median Lane and Bus Curb Lane Defined) (서울시 버스전용차로구간의 버스사고 영향요인 분석 연구 (중앙전용차로 및 가로변전용차로 구분))

  • Lim, Jun-Beom;Hong, Ji-Yeon;Chang, Il-Jun;Park, Jun-Tae
    • International Journal of Highway Engineering
    • /
    • v.14 no.2
    • /
    • pp.145-155
    • /
    • 2012
  • At present, Seoul City is putting the bus exclusive lane system into practice according to mass transit revitalization policy. Starting with the installation of roadside bus exclusive lane in the past, at present, even the road sections for central- lane bus exclusive lane system are on the increase. The purpose of this research is to analyze the factors giving impacts on bus accident on central bus exclusive lane and roadside bus exclusive lane. In case of the central bus exclusive lane, the 6 variables, such as the number of bus routes, number of access & entrance to central lanes patterns, whether the stop line of central lanes retreats or not, separated distance between the stop line of central lanes and crosswalks, traffic volume, and number of bus routes stopping at bus stops on reversible lanes, were found to have a significant influence on bus accidents. In case of roadside bus exclusive lane sections, the four variables such as the number of right-turn bus routes, whether to be chronic illegal parking & stopping, time for the walk signal, and forms of land use, etc. were found to have a significant influence on bus accident.

Bayesian inference of longitudinal Markov binary regression models with t-link function (t-링크를 갖는 마코프 이항 회귀 모형을 이용한 인도네시아 어린이 종단 자료에 대한 베이지안 분석)

  • Sim, Bohyun;Chung, Younshik
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.47-59
    • /
    • 2020
  • In this paper, we present the longitudinal Markov binary regression model with t-link function when its transition order is known or unknown. It is assumed that logit or probit models are considered in binary regression models. Here, t-link function can be used for more flexibility instead of the probit model since the t distribution approaches to normal distribution as the degree of freedom goes to infinity. A Markov regression model is considered because of the longitudinal data of each individual data set. We propose Bayesian method to determine the transition order of Markov regression model. In particular, we use the deviance information criterion (DIC) (Spiegelhalter et al., 2002) of possible models in order to determine the transition order of the Markov binary regression model if the transition order is known; however, we compute and compare their posterior probabilities if unknown. In order to overcome the complicated Bayesian computation, our proposed model is reconstructed by the ideas of Albert and Chib (1993), Kuo and Mallick (1998), and Erkanli et al. (2001). Our proposed method is applied to the simulated data and real data examined by Sommer et al. (1984). Markov chain Monte Carlo methods to determine the optimal model are used assuming that the transition order of the Markov regression model are known or unknown. Gelman and Rubin's method (1992) is also employed to check the convergence of the Metropolis Hastings algorithm.

The wage determinants applying sample selection bias (표본선택 편의를 반영한 임금결정요인 분석)

  • Park, Sungik;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1317-1325
    • /
    • 2016
  • The purpose of this paper is to explain the factors affecting the wage of the vocational high school graduates. We particularly examine the effectiveness of controlling sample selection bias by employing the Tobit model and Heckman sample selection model. The major results are as follows. First it is shown that the Tobit model and Heckman sample selection model controlling sample selection bias is statistically significant. Hence all the independent variables seem to be statistically consistent with the theoretical model. Second, gender was statistically significant, both in the probability of employment and the wage. Third, the employment probability and wage of Maester high school graduates were shown to be high compared to all other graduates. Fourth, the higher parent's income, the higher are both the employment probability and the wage. Finally, parents education level, high school grade, satisfaction, and a number of licenses were found to be statistically significant, both in the probability of employment and wages.

Onion yield estimation using spatial panel regression model (공간 패널 회귀모형을 이용한 양파 생산량 추정)

  • Choi, Sungchun;Baek, Jangsun
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.5
    • /
    • pp.873-885
    • /
    • 2016
  • Onions are grown in a few specific regions of Korea that depend on the climate and the regional characteristic of the production area. Therefore, when onion yields are to be estimated, it is reasonable to use a statistical model in which both the climate and the region are considered simultaneously. In this paper, using a spatial panel regression model, we predicted onion yields with the different weather conditions of the regions. We used the spatial auto regressive (SAR) model that reflects the spatial lag, and panel data of several climate variables for 13 main onion production areas from 2006 to 2015. The spatial weight matrix was considered for the model by the threshold value method and the nearest neighbor method, respectively. Autocorrelation was detected to be significant for the best fitted model using the nearest neighbor method. The random effects model was chosen by the Hausman test, and the significant climate variables of the model were the cumulative duration time of sunshine (January), the average relative humidity (April), the average minimum temperature (June), and the cumulative precipitation (November).

Estimation of Advertising Exposure Distribution by Zero-inflation Regression Models (영과잉 회귀모형을 이용한 광고노출분포 추정)

  • Lee, Dong-Hee
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2841-2852
    • /
    • 2018
  • This study examines regression modeling method using zero-inflated distribution in relation to estimation of exposure distribution required in advertisement media planning. Exposure distribution is the percentage of audiences that are exposed each time the ad is repeated. Such an exposure distribution plays a very important role in providing basic information necessary for calculating various indicators for quantitatively measuring the advertising effect. Especially, due to the decrease of advertising price and the spread of various media, the frequency of the advertisement or the broadcasting of specific advertisements has been greatly increased compared to the past. As a result, the frequency of exposure is relatively decreasing. In this situation, the number of individuals who are not exposed to the media, that is, are not exposed to advertising structurally is increasing. This research proposes advertising exposure distribution models using a zero-inflated regression model, and conducts a comparative study using actual cases.

Developing the Traffic Accident Prediction Model using Classification And Regression Tree Analysis (CART분석을 이용한 교통사고예측모형의 개발)

  • Lee, Jae-Myung;Kim, Tae-Ho;Lee, Yong-Taeck;Won, Jai-Mu
    • International Journal of Highway Engineering
    • /
    • v.10 no.1
    • /
    • pp.31-39
    • /
    • 2008
  • Preventing the traffic accident by accurately predicting it in advance can greatly improve road traffic safety. The accurate traffic accident prediction model requires not only understanding of the factors that cause the accident but also having the transferability of the model. So, this paper suggest the traffic accident diagram using CART(Classification And Regression Tree) analysis, developed Model is compared with the existing accident prediction models in order to test the goodness of fit. The results of this study are summarized below. First, traffic accident prediction model using CART analysis is developed. Second, distance(D), pedestrian shoulder(m) and traffic volume among the geometrical factors are the most influential to the traffic accident. Third. CART analysis model show high predictability in comparative analysis between models. This study suggest the basic ideas to evaluate the investment priority for the road design and improvement projects of the traffic accident blackspots.

  • PDF

Marginal Effect Analysis of Travel Behavior by Count Data Model (가산자료모형을 기초로 한 통행행태의 한계효과분석)

  • 장태연
    • Journal of Korean Society of Transportation
    • /
    • v.21 no.3
    • /
    • pp.15-22
    • /
    • 2003
  • In general, the linear regression model has been used to estimate trip generation in the travel demand forecasting procedure. However, the model suffers from several methodological limitations. First, trips as a dependent variable with non-negative integer show discrete distribution but the model assumes that the dependent variable is continuously distributed between -$\infty$ and +$\infty$. Second, the model may produce negative estimates. Third, even if estimated trips are within the valid range, the model offers only forecasted trips without discrete probability distribution of them. To overcome these limitations, a poisson model with a assumption of equidispersion has frequently been used to analyze count data such as trip frequencies. However, if the variance of data is greater than the mean. the poisson model tends to underestimate errors, resulting in unreliable estimates. Using overdispersion test, this study proved that the poisson model is not appropriate and by using Vuong test, zero inflated negative binomial model is optimal. Model reliability was checked by likelihood test and the accuracy of model by Theil inequality coefficient as well. Finally, marginal effect of the change of socio-demographic characteristics of households on trips was analyzed.