• Title/Summary/Keyword: Regression Study

Search Result 28,750, Processing Time 0.048 seconds

Factors Related to Regional Variation in the High-risk Drinking Rate in Korea: Using Quantile Regression

  • Kim, Eun-Su;Nam, Hae-Sung
    • Journal of Preventive Medicine and Public Health
    • /
    • v.54 no.2
    • /
    • pp.145-152
    • /
    • 2021
  • Objectives: This study aimed to identify regional differences in the high-risk drinking rate among yearly alcohol users in Korea and to identify relevant regional factors for each quintile using quantile regression. Methods: Data from 227 counties surveyed by the 2017 Korean Community Health Survey (KCHS) were analyzed. The analysis dataset included secondary data extracted from the Korean Statistical Information Service and data from the KCHS. To identify regional factors related to the high-risk drinking rate among yearly alcohol users, quantile regression was conducted by dividing the data into 10%, 30%, 50%, 70%, and 90% quantiles, and multiple linear regression was also performed. Results: The current smoking rate, perceived stress rate, crude divorce rate, and financial independence rate, as well as one's social network, were related to the high-risk drinking rate among yearly alcohol users. The quantile regression revealed that the perceived stress rate was related to all quantiles except for the 90% quantile, and the financial independence rate was related to the 50% to 90% quantiles. The crude divorce rate was related to the high-risk drinking rate among yearly alcohol users in all quantiles. Conclusions: The findings of this study suggest that local health programs for high-risk drinking are needed in areas with high local stress and high crude divorce rates.

Testing the Equality of Two Linear Regression Models : Comparison between Chow Test and a Permutation Test

  • Um, Yonghwan
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.157-164
    • /
    • 2021
  • Regression analysis is a well-known statistical technique useful to explain the relationship between response variable and predictor variables. In particular, Researchers are interested in comparing the regression coefficients(intercepts and slopes) of the models in two independent populations. The Chow test, proposed by Gregory Chow, is one of the most commonly used methods for comparing regression models and for testing the presence of a structural break in linear models. In this study, we propose the use of permutation method and compare it with Chow test analysis for testing the equality of two independent linear regression models. Then simulation study is conducted to examine the powers of permutation test and Chow test.

A Study on Combustion Characteristics of Non-Circular Grain in Hybrid Rocket for RATO (Rocket-Assisted Take Off) System (RATO(Rocket-Assisted Take Off) 시스템 적용을 위한 하이브리드 로켓 비단공형 연료 그레인 기초 연소특성 연구)

  • Su Jin Kim;Su Han Ko;Sul Hee Kim;Gyeong Mo Kim;Seong Geun Lee;Ye Chan Han;Hee Jang Moon
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.30 no.4
    • /
    • pp.184-190
    • /
    • 2022
  • In an attempt to apply hybrid rocket to the RATO (Rocket-Assisted Take Off) system, combustion characteristics of the non-circular grain were figured out in this study. Having larger combustion area, it was reconfirmed that the non-circular grain has advantages over regression rate, characteristic velocity and chamber pressure in which all gave higher values. Experiments were performed to understand the effect of the non-circular grain geometry over time where local regression rates depending on grain location were analyzed. It was found that the regression rate of five distinct locations were different. Partial conclusion driven was that these differences are due to the heat transfer caused by dissimilar distances from the flame layer. Besides, as combustion duration increased, the fuel port became circular, and the regression rate converged to a single value over the whole grain.

Predicting the Young's modulus of frozen sand using machine learning approaches: State-of-the-art review

  • Reza Sarkhani Benemaran;Mahzad Esmaeili-Falak
    • Geomechanics and Engineering
    • /
    • v.34 no.5
    • /
    • pp.507-527
    • /
    • 2023
  • Accurately estimation of the geo-mechanical parameters in Artificial Ground Freezing (AGF) is a most important scientific topic in soil improvement and geotechnical engineering. In order for this, one way is using classical and conventional constitutive models based on different theories like critical state theory, Hooke's law, and so on, which are time-consuming, costly, and troublous. The others are the application of artificial intelligence (AI) techniques to predict considered parameters and behaviors accurately. This study presents a comprehensive data-mining-based model for predicting the Young's Modulus of frozen sand under the triaxial test. For this aim, several single and hybrid models were considered including additive regression, bagging, M5-Rules, M5P, random forests (RF), support vector regression (SVR), locally weighted linear (LWL), gaussian process regression (GPR), and multi-layered perceptron neural network (MLP). In the present study, cell pressure, strain rate, temperature, time, and strain were considered as the input variables, where the Young's Modulus was recognized as target. The results showed that all selected single and hybrid predicting models have acceptable agreement with measured experimental results. Especially, hybrid Additive Regression-Gaussian Process Regression and Bagging-Gaussian Process Regression have the best accuracy based on Model performance assessment criteria.

Effect of zero imputation methods for log-transformation of independent variables in logistic regression

  • Seo Young Park
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.4
    • /
    • pp.409-425
    • /
    • 2024
  • Logistic regression models are commonly used to explain binary health outcome variable using independent variables such as patient characteristics in medical science and public health research. Although there is no distributional assumption required for independent variables in logistic regression, variables with severely right-skewed distribution such as lab values are often log-transformed to achieve symmetry or approximate normality. However, lab values often have zeros due to limit of detection which makes it impossible to apply log-transformation. Therefore, preprocessing to handle zeros in the observation before log-transformation is necessary. In this study, five methods that remove zeros (shift by 1, shift by half of the smallest nonzero, shift by square root of the smallest nonzero, replace zeros with half of the smallest nonzero, replace zeros with the square root of the smallest nonzero) are investigated in logistic regression setting. To evaluate performances of these methods, we performed a simulation study based on randomly generated data from log-normal distribution and logistic regression model. Shift by 1 method has the worst performance, and overall shift by half of the smallest nonzero method, replace zeros with half of the smallest nonzero method, and replace zeros with the square root of the smallest nonzero method showed comparable and stable performances.

A Study for Influence of Sun Glare Effect on Traffic Safety at Tunnel Hood (직광에 의한 눈부심 현상이 터널 출구부 안전성에 미치는 영향 연구)

  • Kim, Youngrok;Kim, Sangyoup;Choi, Jaisung;Lee, Daesung
    • International Journal of Highway Engineering
    • /
    • v.14 no.6
    • /
    • pp.103-110
    • /
    • 2012
  • PURPOSES : In Korea, over 70 percent of the land consists of mountainous and rolling area. Thus, tunnels continue its upward trend as road network are extended. In these circumstances, the importance of tunnel has been increased nowadays and then its safety investigation and research should be performed. This study is focus on confirming and improving the safety of tunnel. On tunnel hood, sunglare effect can irritate driver's behavior instantly and this can result in incident. METHODS : The study of this phenomenon is rarely conducted in domestic and foreign papers, so there is no proper measure for this. This study analyzes the driving environment of the effect of sunglare effect on tunnel hood. RESULTS : Traffic accidents stem from complex set of factors. This study build the Traffic Accident Prediction Models to find out the effect of sunglare effect on tunnel's hood. The independent variables are traffic volume, geometric design of road, length of tunnel and road side environment. Using these variables, this model estimates accident frequency on tunnel hood by Poisson regression model and Negative binomial regression model. Although Poisson regression model have more proper goodness of fit than Negative binomial regression model, Poisson regression model has overdipersion problem. So the Negative binomial regression model is used in this analysis. CONCLUSIONS : Consequently, the model shows that sunglare effect can play a role in driving safety on tunnel hood. As a result, the information of sunglare effect should be noticed ahead of tunnel hood so this can prevent drivers from being in hazard situation.

Analysis of health-related quality of life using Beta regression (베타회귀분석 방법을 이용한 건강 관련 삶의 질 자료 분석)

  • Jang, Eun Jin
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.3
    • /
    • pp.547-557
    • /
    • 2017
  • The health-related quality of life data are commonly skewed and bounded with spike at the perfect health status, and the variance tended to be heteroscedastic. In this study, we have developed a prediction model for EQ-5D using linear regression model, beta regression model, and extended beta regression model with mean and precision submodel, and also compared the predictive accuracy. The extended beta regression model allows to model skewness and differences in dispersion related to covariates. Although the extended beta regression model has higher prediction accuracy than the linear regression model, the overlapped confidence intervals suggested that the extended beta regression model was superior to the linear regression model. However, the expended beta regression model could explain the heteroscedasticity and predict within the bounded range. Therefore, the expended beta regression model are appropriate for fitting the health-related quality of life data such as EQ-5D.

Evaluation of the heat island in transition zone of three cities in Kyungpook, Korea (추이대(推移帶)를 중심으로 한 경상북도 3개 도시의 열섬 평가)

  • Park, In Hwan;Jang, Gab Sue;Kim, Jong Yong
    • Journal of Environmental Impact Assessment
    • /
    • v.8 no.2
    • /
    • pp.73-82
    • /
    • 1999
  • This study analyzed the relationship between NDVI(Normalized Difference Vegetation Index) and urban heat island in three cities: Daegu, Kyungju, and Pohang for understanding the degree of nature conservation concentrating in the transition zone of them. Daegu city is the third city in Korea which has a dense population. Kyungju is a traditional city which has good nature. Pohang is an industrial city which has those of characters of Daegu and Kyungju. Landsat 1M data in May 17, 1997 were used for the analysis of heat island. There were about four theoretical models to estimate the surface temperature from TM data: Two-point linear model, Linear regression model, Quadratic regression model, and Cubic regression model. In this study, Linear regression model had been utilized to analyze the urban heat island. On the resultant images, the transition zone of Daegu was urbanized more extremely than those of other two cities. It is thought that the analysis of relationship between NDVI and surface temperature, used in this study, is regarded as one of effective methodologies for urban-environmental detection from satellite imageries.

  • PDF

A Comparative Study on the Genetic Algorithm and Regression Analysis in Urban Population Surface Modeling (도시인구분포모형 개발을 위한 GA모형과 회귀모형의 적합성 비교연구)

  • Choei, Nae-Young
    • Spatial Information Research
    • /
    • v.18 no.5
    • /
    • pp.107-117
    • /
    • 2010
  • Taking the East-Hwasung area as the case, this study first builds gridded population data based on the municipal population survey raw data, and then measures, by way of GIS tools, the major urban spatial variables that are thought to influence the composition of the regional population. For the purpose of comparison, the urban models based on the Genetic Algorithm technique and the regression technique are constructed using the same input variables. The findings indicate that the GA output performed better in differentiating the effective variables among the pilot model variables, and predicted as much consistent and meaningful coefficient estimates for the explanatory variables as the regression models. The study results indicate that GA technique could be a very useful and supplementary research tool in understanding the urban phenomena.

The Determinants of Listed Commercial Banks' Profitability in Vietnam

  • PHAN, Hai Thanh;HOANG, Tien Ngoc;DINH, Linh Viet;HOANG, Dat Ngoc
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.11
    • /
    • pp.219-229
    • /
    • 2020
  • The study investigates the factors affecting the profitability of listed commercial banks in Vietnam. Survey data for this research were collected from 10 Vietnamese listed commercial banks for the period from 2008 to 2018. In the study, we have built a model of econometric regression with the dependent variable being listed commercial banks' profitability results measured through ROA. The research methods used include descriptive statistics, IV regression and OLS regression analysis, and the authors carried out the model verification with Stata 14 software. The results showed that operating efficiency, loans size, retail loans ratio, state ownership, inflation rate, and GDP growth are factors that have a positive impact on profitability On the other hand, variables such as capital size, credit risk, liquidity risk, bank size, and revenue diversification are statistically insignificant; hence, these variables are not statistically adequate to indicate the influence of those independent variables to banks' profitability. The findings of this study suggest that the quality of assets should be considered in the context that bad debt risks come from lending heavily to the real estate sector. Meeting Basel II's capital compliance requirements is relatively difficult for small listed commercial banks compared to bigger listed commercial banks in Vietnam.