• Title/Summary/Keyword: 다중선형 회귀모형

Search Result 135, Processing Time 0.025 seconds

Statistical review and explanation for Lanchester model (란체스터 모형에 대한 통계적 고찰과 해석)

  • Yoo, Byung Joo
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.335-345
    • /
    • 2020
  • This paper deals with the problem of estimating the log-transformed linear regression model to fit actual battle data from the Ardennes Campaign of World War II into the Lanchester model. The problem of determining a global solution for parameters and multicollinearity problems are identified and modified by examining the results of previous studies on data. The least squares method requires attention because a local solution can be found rather than a global solution if considering a specific constraint or a limited candidate group. The method of exploring this multicollinearity problem can be confirmed by a statistic known as a variance inflation factor. Therefore, the Lanchester model is simplified to avoid these problems, and the combat power attrition rate model was proposed which is statistically significant and easy to explain. When fitting the model, the dependence problem between the data has occurred due to autocorrelation. Matters that might be underestimated or overestimated were resolved by the Cochrane-Orcutt method as well as guaranteeing independence and normality.

Pedestrian Accident Rate Models of Circular Intersection Near Schools (학교와 인접한 원형교차로의 보행자 사고율 모형)

  • SON, Seul Ki;LEE, Min Yeong;PARK, Byung Ho
    • Journal of Korean Society of Transportation
    • /
    • v.35 no.4
    • /
    • pp.321-331
    • /
    • 2017
  • The objective of this study is to analyze the factors affecting the pedestrian accidents of roundabout near schools. To this end, this study has focus on the comparative analysis of pedestrian accidents across different school areas. The traffic accident data from 2007 to 2014 are collected from TAAS data set of Road Traffic Authority. To develop the pedestrian accident rate model, the linear regression model has been utilized in this study. 28 explanatory variables such as geometry and traffic volume factors are used. The main results are summarized as follows. First, the null hypotheses that the number of pedestrian accidents are the same are rejected. Second, 5 multiple linear regression accident models with higher statistical significance (adjusted $R^2$ of 0.651~0.788) have been developed. Third, while the common variables of 3 models (model I~III) related to school location are evaluated to be the pedestrian island, crosswalk, types of roundabout, elementary school and bus stop. Fourth, while the common variable of 3 models (model III~V) related to near school area or not is evaluated to be pedestrian island, type of roundabout, sidewalk, elementary school, speed hump, speed limit sign and number of entry lane. As a result, the installation of pedestrian islands and crosswalk might be expected to decrease the number of pedestrian accidents near schools.

A Study on Regionalization of Parameters for Sacramento Continuous Rainfall-Runoff Model Using Watershed Characteristics (유역특성인자를 활용한 Sacramento 장기유출모형의 매개변수 지역화 기법 연구)

  • Kim, Tae-Jeong;Jeong, Ga-In;Kim, Ki-Young;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.48 no.10
    • /
    • pp.793-806
    • /
    • 2015
  • The simulation of natural streamflow at ungauged basins is one of the fundamental challenges in hydrology community. The key to runoff simulation in ungauged basins is generally involved with a reliable parameter estimation in a rainfall-runoff model. However, the parameter estimation of the rainfall-runoff model is a complex issue due to an insufficient hydrologic data. This study aims to regionalize the parameters of a continuous rainfall-runoff model in conjunction with a Bayesian statistical technique to consider uncertainty more precisely associated with the parameters. First, this study employed Bayesian Markov Chain Monte Carlo scheme for the estimation of the Sacramento rainfall-runoff model. The Sacramento model is calibrated against observed daily runoff data, and finally, the posterior density function of the parameters is derived. Second, we applied a multiple linear regression model to the set of the parameters with watershed characteristics, to obtain a functional relationship between pairs of variables. The proposed model was also validated with gauged watersheds in accordance with the efficiency criteria such as the Nash-Sutcliffe efficiency, index of agreement and the coefficient of correlation.

A Multivariate Analysis of Korean Professional Players Salary (한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석)

  • Song, Jong-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.441-453
    • /
    • 2008
  • We analyzed Korean professional basketball and baseball players salary under the assumption that it depends on the personal records and contribution to the team in the previous year. We extensively used data visualization tools to check the relationship among the variables, to find outliers and to do model diagnostics. We used multiple linear regression and regression tree to fit the model and used cross-validation to find an optimal model. We check the relationship between variables carefully and chose a set of variables for the stepwise regression instead of using all variables. We found that points per game, number of assists, number of free throw successes, career are important variables for the basketball players. For the baseball pitchers, career, number of strike-outs per 9 innings, ERA, number of homeruns are important variables. For the baseball hitters, career, number of hits, FA are important variables.

Soil moisture estimation of YongdamDam watershed using vegetation index from Sentinel-1 and -2 satellite images (Sentinel-1 및 Sentinel-2 위성영상기반 식생지수를 활용한 용담댐 유역의 토양수분 산정)

  • Son, Moobeen;Chung, Jeehun;Lee, Yonggwan;Woo, Soyoung;Kim, Seongjoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.161-161
    • /
    • 2021
  • 본 연구에서는 금강 상류의 용담댐 유역(930.0 km2)을 대상으로 Sentinel-1 SAR(Synthetic Aperture Radar) 및 Sentinel-2 MultiSpectral Instrument(MSI) 위성영상을 활용한 토양수분 산출연구를 수행하였다. 연구에 사용된 자료는 10 m 해상도의 Sentinel-1 IW(Interferometric Wide swath) mode GRD(Ground Range Detected) product의 VV(Vertical transmit-Vertical receive) 및 VH(Vertical transmit-Horizontal receive) 편파자료와 Sentinel-2 Level-2A Bottom of Atmosphere(BOA) reflectance 자료를 2019년에 대해 각 6일 및 5일 간격으로 구축하였다. 위성영상의 Image processing은 SNAP(SentiNel Application Platform)을 활용하여 Sentinel-1 영상의 편파 별(VV, VH) 후방산란계수와 Sentinel-2의 적색(Band-4) 및 근적외(Band-8) 영상을 생성하였다. 토양수분 산출 모형은 다중선형회귀모형(Multiple Linear Regression Model)을 활용하였으며, 각 지점에 해당하는 토양 속성별로 모형을 생성하였다. 모형의 입력자료는 Sentinel-1 위성의 편파별 후방산란계수, Sentinel-1 위성에서 산출된 식생지수 RVI(Radar Vegetation Index)와 Sentinel-2 위성에서 산출된 NDVI(Normalized Difference Vegetation Index)를 활용하여 식생의 영향을 반영하고자 하였다. 모의 된 토양수분을 검증하기 위해 6개 지점의 TDR(Time Domain Reflectometry) 기반 실측 토양수분 자료를 수집하고, 상관계수(Correlation Coefficient, R), 평균제곱근오차(Root Mean Square Error, RMSE) 및 IOA(Index of Agreement)를 활용하여 전체 기간 및 계절별로 나누어 검증할 예정이다.

  • PDF

Estimation of AADT Using Multiple Linear Regression in Isolated Area (다중선형 회귀분석을 이용한 고립지역에서의 AADT 추정방안 연구)

  • Kim, Tae-woon;Oh, Ju-sam
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.4
    • /
    • pp.887-896
    • /
    • 2015
  • This study estimates future AADT using historical AADT and socio-economic factors in isolated area. Multiple linear regression method by socio-economic factors are lower MAPE and higher R-square than using historical AADT. Analysis of socio-economic factors influence AADT in isolated typical areas, varied socio-economic factors influence on AADT. In isolated coastal areas, oil price influence on AADT. AADT forecasting model in isolated area is excellent when analysising $R^2$ and MAPE. It is assume that estimation of AADT in isolated area using multiple linear regression is accurate because of a little passed traffic volume and traffic volume fluctuation.

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.

Probabilistic Runoff Analysis using Ensemble Technoque with Localization Method (앙상블 기반 지역화 기법을 이용한 확률론적 유출량 분석)

  • Lee, Han-Yong;Jang, Suk-Hwan;Lee, Jae-Kyoung;Jo, Jun-Won
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.207-207
    • /
    • 2019
  • 최근 우리나라는 지역 특성 및 기후변화의 영향으로 인해 수문학적 요소의 변동성이 커지고 수자원의 지속적인 관리에 있어 유출량은 중요한 문제로 여겨지고 있다. 특히 일부 소하천 또는 접경지역과 같은 미계측유역은 수문학적 요소에 대한 자료가 부족하고 수문모형의 초기치 설정과 과거 유출량 자료를 통하여 최적화한 매개변수를 결정해야하므로 장기유출분석이 어렵다. 본 연구의 적용유역으로 미계측유역인 임진강상류 유역에 대한 유출량 추정을 위해 계측 유역의 자료를 활용하여 모형의 매개변수 등을 추정하는 지역화 기법인 다중선형회귀분석과 공간근접분석을 활용하여 유출량을 산정 및 검증하였다. 또한, 확률론적 예측이 가능한 앙상블 기법 적용을 통한 유출량 예측을 하였고, 이를 예측 정확성 평가지표를 통해 효율성 검토를 수행하여 미계측유역의 유출량에 대해 확률론적 예측을 수행하였다. 대표적 지역화 기법의 적용성을 검토한 결과, 계측유역을 통해 다중선형회귀분석과 공간근접분석을 abcd 모형에 적용하였다. 모의유출량을 산정하고 실측 유출량과 비교 분석 결과 모의정확성이 높게 분석되었다. 이와 같은 검증 결과를 토대로 미계측유역의 유출량을 추정하였다. 또한, 지역화 기법을 앙상블 기법에 적용하여 확률론적 유출량 예측의 효율성을 검토하였다. 적용유역과 같은 지류를 포함하고 있는 임진강하류 유역을 대상으로 수행하였다. 검증기간(2013년~2017년) 동안의 월 예측 유출량 앙상블 생성을 위해 과거 강우량와 증발량(1988년~2012년) 자료를 사용하였으며, 지역화 기법을 적용한 abcd 모형을 이용하였다. 예측 유출량의 정확성 평가를 실시하였으며, 정확성이 비교적 높게 분석되었다. 이와 같은 결과를 토대로 미계측유역의 확률론적 유출량을 예측하였다. 따라서, 대표적 지역화 기법을 앙상블 기법에 적용하여 확률론적 유출량을 예측할 경우 보다 정확한 유출량 예측이 가능하다.

  • PDF

Estimation of Snow Damages using Multiple Regression Model - The Case of Gangwon Province - (대설피해액 추정을 위한 다중회귀 모형의 적용성 평가 - 강원도 지역을 중심으로 -)

  • Kwon, Soon Ho;Chung, Gunhui
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.37 no.1
    • /
    • pp.61-72
    • /
    • 2017
  • Due to the climate change, damages of human life and property caused by natural disaster have recently been increasing consistently. In South Korea, total damage by natural disasters over 20 years from 1994 to 2013 is about 1.0 million dollars. The 13% of total damage caused by heavy snow. This is smaller amount than the damage by heavy rainfall or typhoon, but still could cause severe damage in the society. In this study, the snow damage in Gangwon region was estimated using climate variables (daily maximum snow depth, relative humidity, minimum temperature) and scoio-economic variables (Farm population density, GRDP). Multiple regression analysis with enter method was applied to estimate snow damage. As the results, adjusted R-square is above 0.7 in some sub-regions and shows the good applicability although the extreme values are not predicted well. The developed model might be applied for the prompt disaster response.

The wage determinants of the vocational high school graduates using mixed effects mode (혼합모형을 이용한 특성화고 졸업생의 임금결정요인 분석)

  • Ryu, Jangsoo;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.4
    • /
    • pp.935-946
    • /
    • 2016
  • In this paper, we analyzed wage determinants of the vocational high school graduates utilizing both individual-level and work region-level variables. We formulate the models in the way wage determination has multi-level structure in the sense that individual wage is influenced by individual-level variables (level-1) and work region-level (level-2) variables. To incorporate dependency between individual wages into the model, we utilize hierarchical linear model (HLM). The major results are as follows. First, it is shown that the HLM model is better than the OLS regression models which do not take level-1 and level-2 variables simultaneously into account. Second, random effects on sex, maester dummy and engineering dummy variables are statistically significant. Third, the fixed effects on business hours and mean wage of regular job for level-2 variables are statistically significant effect individual-level wages. Finally, parental education level, parental income, number of licenses and high school grade are statistically significant for higher individual-level wages.