• Title/Summary/Keyword: Multiple regression models

Search Result 879, Processing Time 0.026 seconds

Wage Determinants Analysis by Quantile Regression Tree

  • Chang, Young-Jae
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.2
    • /
    • pp.293-301
    • /
    • 2012
  • Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.

Weekly Maximum Electric Load Forecasting Method for 104 Weeks Using Multiple Regression Models (다중회귀모형을 이용한 104주 주 최대 전력수요예측)

  • Jung, Hyun-Woo;Kim, Si-Yeon;Song, Kyung-Bin
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.63 no.9
    • /
    • pp.1186-1191
    • /
    • 2014
  • Weekly and monthly electric load forecasting are essential for the generator maintenance plan and the systematic operation of the electric power reserve. This paper proposes the weekly maximum electric load forecasting model for 104 weeks with the multiple regression model. Input variables of the multiple regression model are temperatures and GDP that are highly correlated with electric loads. The weekly variable is added as input variable to improve the accuracy of electric load forecasting. Test results show that the proposed algorithm improves the accuracy of electric load forecasting over the seasonal autoregressive integrated moving average model. We expect that the proposed algorithm can contribute to the systematic operation of the power system by improving the accuracy of the electric load forecasting.

Study on the Critical Storm Duration Decision of the Rivers Basin (중소하천유역의 임계지속시간 결정에 관한 연구)

  • Ahn, Seung-Seop;Lee, Hyeo-Jung;Jung, Do-June
    • Journal of Environmental Science International
    • /
    • v.16 no.11
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.

Optimize OTDOA-based Positioning Accuracy by Utilizing Multiple Linear Regression Model under NB-IoT Technology (NB-IoT 기술에서 Multiple Linear Regression Model을 활용하여 OTDOA 기반 포지셔닝 정확도 최적화)

  • Pan, Yichen;Kim, Jaesoo
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2020.07a
    • /
    • pp.139-142
    • /
    • 2020
  • NB-IoT(Narrow Band Internet of Things) is an emerging LPWAN(Low Power Wide Area Network) radio technology. NB-IoT has many advantages like low power, low cost, and high coverage. However low bandwidth and low sampling rates also lead to poor positioning accuracy. This paper proposed a solution to optimize positioning accuracy under the OTDOA(Observed Time Difference of Arrival) approach by utilizing MLR(Multiple Linear Regression) models. Through the MLR model to predict the influence degree of weather(temperature, humidity, light intensity and air pressure) on the arrival time of signal transmission to improve the measurement accuracy. The improvement of measurement accuracy can greatly improve IoT applications based on NB-IoT.

  • PDF

Analysis of Accident Characteristics and Development of Accident Models in the Signalized Intersections of Cheongju and Cheongwon (지방부 신호교차로 사고특성분석 및 모형개발 (청주.청원을 중심으로))

  • Park, Byung-Ho;Yoo, Doo-Seon;Yang, Jeong-Mo;Lee, Young-Min
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.2
    • /
    • pp.35-46
    • /
    • 2008
  • The purposes of this study are to analyze the characteristics and to develop the models of traffic accidents. In pursuing the above, this study gives particular attentions to developing the models(multiple linear, poisson and negative binomial regression) using the data of Cheongju and Cheongwon signalized intersections. The main results analyzed are as follows. First, the accident characteristics of rural area were defined by factor. Second, 4 accident models which are all statistically significant were developed. Finally, such the variables as $X_2$ and $X_{11}$ were evaluated to be specific variables which reflect the characteristics of rural area.

Comparison of tree-based ensemble models for regression

  • Park, Sangho;Kim, Chanmin
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.5
    • /
    • pp.561-589
    • /
    • 2022
  • When multiple classifications and regression trees are combined, tree-based ensemble models, such as random forest (RF) and Bayesian additive regression trees (BART), are produced. We compare the model structures and performances of various ensemble models for regression settings in this study. RF learns bootstrapped samples and selects a splitting variable from predictors gathered at each node. The BART model is specified as the sum of trees and is calculated using the Bayesian backfitting algorithm. Throughout the extensive simulation studies, the strengths and drawbacks of the two methods in the presence of missing data, high-dimensional data, or highly correlated data are investigated. In the presence of missing data, BART performs well in general, whereas RF provides adequate coverage. The BART outperforms in high dimensional, highly correlated data. However, in all of the scenarios considered, the RF has a shorter computation time. The performance of the two methods is also compared using two real data sets that represent the aforementioned situations, and the same conclusion is reached.

Short-term Peak Load Forecasting using Regression Models and Neural Networks (회귀모형과 신경회로망 모형을 이용한 단기 최대전력수요예측)

  • Koh, Hee-Seog;Ji, Bong-Ho;Lee, Hyun-Moo;Lee, Chung-Sik;Lee, Chul-Woo
    • Proceedings of the KIEE Conference
    • /
    • 2000.07a
    • /
    • pp.295-297
    • /
    • 2000
  • In case of power demand forecasting the most important problem is to deal with the load of special-days, Accordingly, this paper presents a method that forecasting special-days load with regression models and neural networks. Special-days load in summer season was forecasted by the multiple regression models using weekday change ratio Neural networks models uses pattern conversion ratio, and orthogonal polynomial models was directly forecasted using past special-days load data. forecasting result obtains % forecast error of about $1{\sim}2[%]$. Therefore, it is possible to forecast long and short special-days load.

  • PDF

Use of big data for estimation of impacts of meteorological variables on environmental radiation dose on Ulleung Island, Republic of Korea

  • Joo, Han Young;Kim, Jae Wook;Jeong, So Yun;Kim, Young Seo;Moon, Joo Hyun
    • Nuclear Engineering and Technology
    • /
    • v.53 no.12
    • /
    • pp.4189-4200
    • /
    • 2021
  • In this study, the relationship between the environmental radiation dose rate and meteorological variables was investigated with multiple regression analysis and big data of those variables. The environmental radiation dose rate and 36 different meteorological variables were measured on Ulleung Island, Republic of Korea, from 2011 to 2015. Not all meteorological variables were used in the regression analysis because the different meteorological variables significantly affect the environmental radiation dose rate during different periods, and the degree of influence changes with time. By applying the Pearson correlation analysis and stepwise selection methods to the big dataset, the major meteorological variables influencing the environmental radiation dose rate were identified, which were then used as the independent variables for the regression model. Subsequently, multiple regression models for the monthly datasets and dataset of the entire period were developed.

A Study on the Emotional Evaluation of fabric Color Patterns

  • Koo, Hyun-Jin;Kang, Bok-Choon;Um, Jin-Sup;Lee, Joon-Whan
    • Science of Emotion and Sensibility
    • /
    • v.5 no.3
    • /
    • pp.11-20
    • /
    • 2002
  • There are Two new models developed for objective evaluation of fabric color patterns by applying a multiple regression analysis and an adaptive foray-rule-based system. The physical features of fabric color patterns are extracted through digital image processing and the emotional features are collected based on the psychological experiments of Soen[3, 4]. The principle physical features are hue, saturation, intensity and the texture of color patterns. The emotional features arc represented thirteen pairs of adverse adjectives. The multiple regression analyses and the adaptive fuzzy system are used as a tool to analyze the relations between physical and emotional features. As a result, both of the proposed models show competent performance for the approximation and the similar linguistic interpretation to the Soen's psychological experiments.

  • PDF

Development of the Wind Power Forecasting System, KIER Forecaster (풍력발전 예보시스템 KIER Forecaster의 개발)

  • Kim Hyun-Goo;Lee Yung-Seop;Jang Mun-Seok;Kyong Nam-Ho
    • New & Renewable Energy
    • /
    • v.2 no.2 s.6
    • /
    • pp.37-43
    • /
    • 2006
  • In this paper, the first forecasting system of wind power generation, KIER Forecaster is presented. KIER Forecaster has been constructed based on statistical models and was trained with wind speed data observed at Gosan Weather Station nearby Walryong Site. Due to short period of measurements at Walryong Site for training the model, Gosan wind data were substituted and transplanted to Walryong Site by using Measure-Correlate-Predict(MCP) technique. The results of One to Three-hour advanced forecasting models are consistent with the measurement at Walryong site. In particular, the multiple regression model by classification of wind speed pattern, which has been developed in this work, shows the best performance comparing with neural network and auto-regressive models.

  • PDF