• Title/Summary/Keyword: multiple linear regression analysis

Search Result 1,113, Processing Time 0.029 seconds

Construction of Urban Crime Prediction Model based on Census Using GWR (GWR을 이용한 센서스 기반 도시범죄 특성 분석 및 예측모델 구축)

  • YOO, Young-Woo;BAEK, Tae-Kyung
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.4
    • /
    • pp.65-76
    • /
    • 2017
  • The purpose of this study was to present a prediction model that reflects crime risk area analysis, including factors and spatial characteristics, as a precursor to preparing an alternative plan for crime prevention and design. This analysis of criminal cases in high-risk areas revealed clusters in which approximately 25% of the cases within the study area occurred, distributed evenly throughout the region. This means that using a multiple linear regression model might overestimate the crime rate in some regions and underestimate in others. It also suggests that the number of deserted houses in an analyzed region has a negative relationship with the dependent variable, based on the multiple linear regression model results, and can also have different influences depending on the region. These results reveal that closure signs in a study area affect the dependent variable differently, depending on the region, rather than a simple or direct relationship with the dependent variable, as indicated by the results of the multiple linear regression model.

A Multivariate Analysis of Korean Professional Players Salary (한국 프로스포츠 선수들의 연봉에 대한 다변량적 분석)

  • Song, Jong-Woo
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.3
    • /
    • pp.441-453
    • /
    • 2008
  • We analyzed Korean professional basketball and baseball players salary under the assumption that it depends on the personal records and contribution to the team in the previous year. We extensively used data visualization tools to check the relationship among the variables, to find outliers and to do model diagnostics. We used multiple linear regression and regression tree to fit the model and used cross-validation to find an optimal model. We check the relationship between variables carefully and chose a set of variables for the stepwise regression instead of using all variables. We found that points per game, number of assists, number of free throw successes, career are important variables for the basketball players. For the baseball pitchers, career, number of strike-outs per 9 innings, ERA, number of homeruns are important variables. For the baseball hitters, career, number of hits, FA are important variables.

Correlation Analysis of Water Quality According to Land Use Types of Reservoir Watershed (유역 토지이용과 저수지 수질의 상관관계 분석)

  • Youn, Dong-Koun;Chung, Sang-Ok
    • Proceedings of the Korean Society of Agricultural Engineers Conference
    • /
    • 2005.10a
    • /
    • pp.614-619
    • /
    • 2005
  • The object of this study was to presented regression equations for obtaining simply and quickly values of water quality items, BOD, COD, T-N, and T-P. Regression equations obtained to analyze relationships for water quality items to land use types in agricultural reservoir watersheds. In order to derive regression equations, a multiple linear regression analysis was used in this studying reservoirs. In this regression analysis, a independent values used land used types and dependent values used BOD, COD, T-N, T-P values in water quality items. The results showed that numbers of regression equation ranging above 0.90 in a multiple correlation coefficient (MCC) was not found, ranging from 0.70 to 0.90 in the MCC was 6, ranging from 0.40 to 0.70 in the MCC was 20, and ranging from 0.20 to 0.40 in the MCC was 4. The results of this study can be used as a basic information for evaluating simply and quickly water quality for proposing and designing steps in water quality policy.

  • PDF

Impact of Maintenance Time of Anti-Ship Missile Harpoon on Operational Availability with Field Data (야전데이터 기반 하푼 유도탄 정비 소요시간이 가동률에 미치는 영향 연구)

  • Choi, Youngjae;Ma, Jungmok
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.23 no.4
    • /
    • pp.426-434
    • /
    • 2020
  • This paper studies the impact of the maintenance time of anti-ship missile Harpoon on operational availability with real field data. The Harpoon maintenance simulation model is developed as a testbed for identifying the optimal inventory levels on operational availability. Using multiple linear regression analysis and integer programming, the optimal inventory levels of essential assemblies are suggested. Finally, the result of sensitivity analysis shows the quantitative impact of maintenance time on operational availability and inventory costs. The authors believe that this quantitative analysis can support policy decisions to decrease maintenance time of missiles.

Price Monitoring Automation with Marketing Forecasting Methods

  • Oksana Penkova;Oleksandr Zakharchuk;Ivan Blahun;Alina Berher;Veronika Nechytailo;Andrii Kharenko
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.37-46
    • /
    • 2023
  • The main aim of the article is to solve the problem of automating price monitoring using marketing forecasting methods and Excel functionality under martial law. The study used the method of algorithms, trend analysis, correlation and regression analysis, ANOVA, extrapolation, index method, etc. The importance of monitoring consumer price developments in market pricing at the macro and micro levels is proved. The introduction of a Dummy variable to account for the influence of martial law in market pricing is proposed, both in linear multiple regression modelling and in forecasting the components of the Consumer Price Index. Experimentally, the high reliability of forecasting based on a five-factor linear regression model with a Dummy variable was proved in comparison with a linear trend equation and a four-factor linear regression model. Pessimistic, realistic and optimistic scenarios were developed for forecasting the Consumer Price Index for the situation of the end of the Russian-Ukrainian war until the end of 2023 and separately until the end of 2024.

The Analysis of User's Degree on Landscape Satisfaction Factors for Pedestrian Road -Case Study of Bun-Dang New Town- (보행자 전용도로의 이용자 경관만족 요인분석 -분당 신도시를 중심으로-)

  • Kim, Dae-Hyun
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.4 no.2
    • /
    • pp.1-10
    • /
    • 2001
  • The purpose of this study was to investigate factors and variables which have significant effects on landscape satisfaction of urban pedestrian road in Bun-dang new town and to suggest basic information for urban pedestrian road design. These works consist of two phase. First, we tested the Hye-Cheon college students' degree of landscape satisfaction for 37 spots of urban pedestrian road and then selected 10 sports slide by the Sturges' formula. Second, we analysed factors and variables on landscape satisfaction of urban pedestrian road using the semantic differential scale method and then processed using descriptive analysis, factor analysis and multiple linear regression analysis. The major findings of this study can be summarized as follows; 1) The difference of landscape adjectives between the highest score of landscape satisfaction slide and the lowest score landscape satisfaction slide were diversity of vegetation, plenty of the shade of a tree, naturalness and cleanness. 2) Diversity of vegetation, width of road, freedom of danger and diversity of environment can be significant variables of major effects on landscape satisfaction of urban pedestrian road by using the multiple linear regression analysis. 3) Factors covering the landscape satisfaction of urban pedestrian road have been found to be Environment of urban pedestrian road and Constitution of urban pedestrian road. By using the Varimaxs' rotation factor analysis for the number of factors' cumulative percentage has been obtained as 64%. 4) Environment of urban pedestrian road and Constitution of urban pedestrian road can be significant factors of major effects on landscape satisfaction of urban pedestrian road by using the multiple linear regression analysis. In conclusion, the landscape satisfaction factors and variables of urban pedestrian road need to be considered in plan or design the urban pedestrian road.

  • PDF

Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting (실시간 수위 예측을 위한 다중선형회귀 모형의 비교)

  • Choi, Seung Yong;Han, Kun Yeun;Kim, Byung Hyun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.32 no.1B
    • /
    • pp.9-20
    • /
    • 2012
  • Recently to overcome limitations of conceptual, hydrological and physics based models for flood stage forecasting, multiple linear regression model as one of data-driven models have been widely adopted for forecasting flood streamflow(stage). The objectives of this study are to compare performance of different multiple linear regression models according to regression coefficient estimation methods and determine most effective multiple linear regression flood stage forecasting models. To do this, the time scale was determined through the autocorrelation analysis of input data and different flood stage forecasting models developed using regression coefficient estimation methods such as LS(least square), WLS(weighted least square), SPW(stepwise) was applied to flood events in Jungrang stream. To evaluate performance of established models, fours statistical indices were used, namely; Root mean square error(RMSE), Nash Sutcliffe efficiency coefficient (NSEC), mean absolute error (MAE), adjusted coefficient of determination($R^{*2}$). The results show that the flood stage forecasting model using SPW(stepwise) parameter estimation can carry out the river flood stage prediction better in comparison with others, and the flood stage forecasting model using LS(least square) parameter estimation is also found to be slightly better than the flood stage forecasting model using WLS(weighted least square) parameter estimation.

Development of the Algorithm for Optimizing Wavelength Selection in Multiple Linear Regression

  • Hoeil Chung
    • Near Infrared Analysis
    • /
    • v.1 no.1
    • /
    • pp.1-7
    • /
    • 2000
  • A convenient algorithm for optimizing wavelength selection in multiple linear regression (MLR) has been developed. MOP (MLP Optimization Program) has been developed to test all possible MLR calibration models in a given spectral range and finally find an optimal MLR model with external validation capability. MOP generates all calibration models from all possible combinations of wavelength, and simultaneously calculates SEC (Standard Error of Calibration) and SEV (Standard Error of Validation) by predicting samples in a validation data set. Finally, with determined SEC and SEV, it calculates another parameter called SAD (Sum of SEC, SEV, and Absolute Difference between SEC and SEV: sum(SEC+SEV+Abs(SEC-SEV)). SAD is an useful parameter to find an optimal calibration model without over-fitting by simultaneously evaluating SEC, SEV, and difference of error between calibration and validation. The calibration model corresponding to the smallest SAD value is chosen as an optimum because the errors in both calibration and validation are minimal as well as similar in scale. To evaluate the capability of MOP, the determination of benzene content in unleaded gasoline has been examined. MOP successfully found the optimal calibration model and showed the better calibration and independent prediction performance compared to conventional MLR calibration.

Estimation of AADT Using Multiple Linear Regression in Isolated Area (다중선형 회귀분석을 이용한 고립지역에서의 AADT 추정방안 연구)

  • Kim, Tae-woon;Oh, Ju-sam
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.4
    • /
    • pp.887-896
    • /
    • 2015
  • This study estimates future AADT using historical AADT and socio-economic factors in isolated area. Multiple linear regression method by socio-economic factors are lower MAPE and higher R-square than using historical AADT. Analysis of socio-economic factors influence AADT in isolated typical areas, varied socio-economic factors influence on AADT. In isolated coastal areas, oil price influence on AADT. AADT forecasting model in isolated area is excellent when analysising $R^2$ and MAPE. It is assume that estimation of AADT in isolated area using multiple linear regression is accurate because of a little passed traffic volume and traffic volume fluctuation.

Evaluation of applicability of pan coefficient estimation method by multiple linear regression analysis (다변량 선형회귀분석을 이용한 증발접시계수 산정방법 적용성 검토)

  • Rim, Chang-Soo
    • Journal of Korea Water Resources Association
    • /
    • v.55 no.3
    • /
    • pp.229-243
    • /
    • 2022
  • The effects of monthly meteorological data measured at 11 stations in South Korea on pan coefficient were analyzed to develop the four types of multiple linear regression models for estimating pan coefficients. To evaluate the applicability of developed models, the models were compared with six previous models. Pan coefficients were most affected by air temperature for January, February, March, July, November and December, and by solar radiation for other months. On the whole, for 12 months of the year, the effects of wind speed and relative humidity on pan coefficient were less significant, compared with those of air temperature and solar radiation. For all meteorological stations and months, the model developed by applying 5 independent variables (wind speed, relative humidity, air temperature, ratio of sunshine duration and daylight duration, and solar radiation) for each station was the most effective for evaporation estimation. The model validation results indicate that the multiple linear regression models can be applied to some particular stations and months.