• 제목/요약/키워드: regional regression analysis

검색결과 662건 처리시간 0.027초

한강수계에서의 부유사 예측을 위한 LOADEST 모형의 회귀식의 평가 (Evaluation of Regression Models in LOADEST to Estimate Suspended Solid Load in Hangang Waterbody)

  • 박윤식;이지민;정영훈;신민환;박지형;황하선;류지철;박장호;김기성
    • 한국농공학회논문집
    • /
    • 제57권2호
    • /
    • pp.37-45
    • /
    • 2015
  • Typically, water quality sampling takes place intermittently since sample collection and following analysis requires substantial cost and efforts. Therefore regression models (or rating curves) are often used to interpolate water quality data. LOADEST has nine regression models to estimate water quality data, and one regression model needs to be selected automatically or manually. The nine regression models in LOADEST and auto-selection by LOADEST were evaluated in the study. Suspended solids data were collected from forty-nine stations from the Water Information System of the Ministry of Environment. Suspended solid data from each station was divided into two groups for calibration and validation. Nash-Stucliffe efficiency (NSE) and coefficient of determination ($R_2$) were used to evaluate estimated suspended solid loads. The regression models numbered 1 and 3 in LOADEST provided higher NSE and $R_2$, compared to the other regression models. The regression modes numbered 2, 5, 6, 8, and 9 in LOADEST provided low NSE. In addition, the regression model selected by LOADEST did not necessarily provide better suspended solid estimations than the other regression models did.

Factors Related to Regional Variation in the High-risk Drinking Rate in Korea: Using Quantile Regression

  • Kim, Eun-Su;Nam, Hae-Sung
    • Journal of Preventive Medicine and Public Health
    • /
    • 제54권2호
    • /
    • pp.145-152
    • /
    • 2021
  • Objectives: This study aimed to identify regional differences in the high-risk drinking rate among yearly alcohol users in Korea and to identify relevant regional factors for each quintile using quantile regression. Methods: Data from 227 counties surveyed by the 2017 Korean Community Health Survey (KCHS) were analyzed. The analysis dataset included secondary data extracted from the Korean Statistical Information Service and data from the KCHS. To identify regional factors related to the high-risk drinking rate among yearly alcohol users, quantile regression was conducted by dividing the data into 10%, 30%, 50%, 70%, and 90% quantiles, and multiple linear regression was also performed. Results: The current smoking rate, perceived stress rate, crude divorce rate, and financial independence rate, as well as one's social network, were related to the high-risk drinking rate among yearly alcohol users. The quantile regression revealed that the perceived stress rate was related to all quantiles except for the 90% quantile, and the financial independence rate was related to the 50% to 90% quantiles. The crude divorce rate was related to the high-risk drinking rate among yearly alcohol users in all quantiles. Conclusions: The findings of this study suggest that local health programs for high-risk drinking are needed in areas with high local stress and high crude divorce rates.

지리적가중회귀분석을 이용한 관외입원진료비 비율의 지역 간 차이 분석 (Analysis on the Regional Variation of the Rate of Inpatient Medical Costs in Local-Out: Geographically Weighted Regression Approach)

  • 조은경;이광수
    • 보건의료산업학회지
    • /
    • 제8권2호
    • /
    • pp.11-22
    • /
    • 2014
  • This study purposed to analyze the regional variation of the local-out rates of inpatient services. Multiple data sources collected from National Health Insurance Corporation and statistics Korea were merged to produce the analysis data set. The unit of analysis in this study was city, Gun, Gu, and all of them were included in analysis. The dependent variable measured the local-out rate of inpatient cost in study regions. Local environments were measured by variables in three dimensions: provider factors, socio-demographic factors, and health status. Along with the traditional ordinary least square (OLS) based regression model, geographically weighted regression (GWR) model were applied to test their effects. SPSS v21 and ArcMap v10.2 were applied for the statistical analysis. Results from OLS regression showed that most variables had significant relationships with the local-out rate of inpatient services. However, some variables had shown diverse directions in regression coefficients depending on regions in GWR. This implied that the study variables might not have consistent effects and they may varied depending the locations.

강원도 지방 소나무의 지역(地域) 간곡선(幹曲線) 및 재적식(材積式) 모델 (Regional Stem Curve and Volume Function Model of Pinus densiflora in Kangwon-Province)

  • 김준순;이우균;변우혁
    • 한국산림과학회지
    • /
    • 제83권4호
    • /
    • pp.521-530
    • /
    • 1994
  • 재적식(材積式)은 보통 흉고직경과 수고의 함수로 표현되는데, 회귀분석(回歸分析)을 통해 정확도가 높은 식이 주로 채택되고 있다. 우리나라에서도 지금까지 흉고직경(D)과 수고(H)를 독립변수로 하는 지수식(指數式)($V=aD^bH^c$)으로 각 수종에 대한 일반(一般)재적식을 유도하고 있다. 본 연구에서는 강원도 지방내의 홍천, 정선, 명주, 원주, 영월지역에 대한 간곡선식(幹曲線式)을 지역별로 유도하고, 이 간곡선식의 회전체(回轉體) 적분(積分)을 통해 지역별 재적을 직접 추정할 수 있는 간곡선 및 재적식 모델을 마련하였다. 조제된 모델에 의해 지역별로 추정된 재적은 기존의 강원도 지방 소나무재적표에 의해 추정된 재적에 비해 정확도가 높았다. 또한 지역간곡선식에 의해 유도된 간곡선의 형태는 지역에 따라 서로 달랐으며, 특히 영월지역과 원주지역의 수간은 다른 지역에 비해 수간상부에서 가늘게 발달하는 것으로 나타났다. 이와같은 간곡선의 다양한 형태는 재적추정에 있어서도 지역간 차이를 유발하였다.

  • PDF

지역 간 흡연율 격차 영향요인 분석 및 금연사업 상대적 효율성 평가: Clustering Analysis와 Data Envelopment Analysis를 활용하여 (Analysis of Factors Affecting the Smoking Rates Gap between Regions and Evaluation of Relative Efficiency of Smoking Cessation Projects)

  • 김희년;이다호;정지윤;구여정;정형선
    • 보건행정학회지
    • /
    • 제30권2호
    • /
    • pp.199-210
    • /
    • 2020
  • Background: Based on the importance of ceasing smoking programs to control the regional disparity of smoking behavior in Korea, this study aims to reveal the variation of smoke rate and determinants of it for 229 provinces. An evaluation of the relative efficiency of the cease smoking program under the consideration of regional characteristics was followed. Methods: The main sources of data are the Korean Statistical Information Service and a national survey on the expenditure of public health centers. Multivariate regression is performed to figure the determinants of regional variation of smoking rate. Based on the result of the regression model, clustering analysis was conducted to group 229 regions by their characteristics. Three clusters were generated. Using data envelopment analysis (DEA), relative efficiency scores are calculated. Results from the pooled model which put 229 provinces in one model to score relative efficiency were compared with the cluster-separated model of each cluster. Results: First, the maximum variation of the smoking rate was 16.9%p. Second, sex ration, the proportion of the elder, and high risk drinking alcohol behavior have a significant role in the regional variation of smoking. Third, the population and proportion of the elder are the main variables for clustering. Fourth, dissimilarity on the results of relative efficiency was found between the pooled model and cluster-separated model, especially for cluster 2. Conclusion: This study figured regional variation of smoking rate and its determinants on the regional level. Unconformity of the DEA results between different models implies the issues on regional features when the regional evaluation performed especially on the programs of public health centers.

물류중심형 자유지대의 경제적 파급효과에 관한 연구 - 부산항을 중심으로 - (Economic Effects of Establishing a Logistic Free Zone in the Port of Busan)

  • 손애휘
    • 한국항해항만학회:학술대회논문집
    • /
    • 한국항해항만학회 2000년도 추계학술대회논문집
    • /
    • pp.33.2-42
    • /
    • 2000
  • This study probes the necessity of establishing a logistic free zone in Port of Busan. It considers the economic effects of establishing the logistic free zone of Busan Port, and suggests policy prescriptions for introducing the free zone system and improving the logistics functions of Busan Port. Using input-output table data, the regression analysis was able to provide a quantitative prediction on effects of making the Busan Port a tariff-free zone. Influence for the regional economy due to the enforcement of the free zone system this research found that a strong positive effects should be expected on the Busan regional economy once the logistic free zone would be set up at the Port of Busan. The positive economic effects on Busan regional industries might be further strengthened if the value-added logistics function of Busan Port could be supplemented by linking to the hinterland of Busan Port.

  • PDF

설명 가능한 인공지능을 이용한 지역별 출산율 차이 요인 분석 (Analysis of Regional Fertility Gap Factors Using Explainable Artificial Intelligence)

  • 이동우;김미경;윤정윤;류동원;송재욱
    • 산업경영시스템학회지
    • /
    • 제47권1호
    • /
    • pp.41-50
    • /
    • 2024
  • Korea is facing a significant problem with historically low fertility rates, which is becoming a major social issue affecting the economy, labor force, and national security. This study analyzes the factors contributing to the regional gap in fertility rates and derives policy implications. The government and local authorities are implementing a range of policies to address the issue of low fertility. To establish an effective strategy, it is essential to identify the primary factors that contribute to regional disparities. This study identifies these factors and explores policy implications through machine learning and explainable artificial intelligence. The study also examines the influence of media and public opinion on childbirth in Korea by incorporating news and online community sentiment, as well as sentiment fear indices, as independent variables. To establish the relationship between regional fertility rates and factors, the study employs four machine learning models: multiple linear regression, XGBoost, Random Forest, and Support Vector Regression. Support Vector Regression, XGBoost, and Random Forest significantly outperform linear regression, highlighting the importance of machine learning models in explaining non-linear relationships with numerous variables. A factor analysis using SHAP is then conducted. The unemployment rate, Regional Gross Domestic Product per Capita, Women's Participation in Economic Activities, Number of Crimes Committed, Average Age of First Marriage, and Private Education Expenses significantly impact regional fertility rates. However, the degree of impact of the factors affecting fertility may vary by region, suggesting the need for policies tailored to the characteristics of each region, not just an overall ranking of factors.

Bayesian 다중회귀분석을 이용한 저수량(Low flow) 지역 빈도분석 (Regional Low Flow Frequency Analysis Using Bayesian Multiple Regression)

  • 김상욱;이길성
    • 한국수자원학회논문집
    • /
    • 제41권3호
    • /
    • pp.325-340
    • /
    • 2008
  • 본 연구는 저수량 지역 빈도분석(regional low flow frequency analysis)을 수행하기 위하여 일반최소자승법(ordinary least squares method)을 이용한 Bayesian 다중회귀분석을 적용하였으며, 불확실성측면에서의 효과를 탐색하기 위하여 Bayesian 다중회귀분석에 의한 추정치와 t 분포를 이용하여 산정한 일반 다중회귀분석의 추정치의 신뢰구간을 비교분석하였다. 각 재현기간별 비교결과를 보면 t 분포를 이용하여 산정된 평균 추정치와 Bayesian 다중회귀분석에 의한 평균 추정치는 크게 다르지 않았다. 그러나 불확실성 측면에서 평가해볼 때 신뢰구간의 상한추정치와 하한추정치의 차이는 Bayesian 다중회귀분석을 사용한 경우가 기존 방법을 사용한 경우보다 훨씬 작은 것으로 나타났으며, 이로부터 저수량(low flow) 지역 빈도분석을 수행하는 경우 Bayesian 다중회귀분석이 일반 회귀분석보다 불확실성을 표현하는데 있어서 우수하다는 결과를 얻을 수 있었다. 또한 낙동강 유역에 2개의 미계측 유역을 선정하고 구축된 Bayesian 다중회귀모형을 적용하여 불확실성을 포함한 미계측 유역에서의 저수량(low flow)을 추정하였으며 이와 같은 방법이 미계측 유역에서의 저수(low flow) 특성을 나타내는 데 있어서 효과적일 수 있음을 입증하였다.

The Effect of CSR Activities of the Citizens Professional Football Club on Regional Attachment and Expansion of Fans: Focused on Seongnam Football Club, Korea

  • JUNG, Sam Kwon;KWON, Ki Hyun;LEE, Hyuk Jin
    • Journal of Sport and Applied Science
    • /
    • 제5권4호
    • /
    • pp.9-14
    • /
    • 2021
  • Purpose: This study aims to empirically analyze fans' responses to the types of Corporate Social Responsibility (CSR) activities implemented by the citizens professional football club and seek strategic measures for the continuous growth of the club and the formation of long-term relationships with fans. The purpose of this study is to investigate the relationship between regional attachment and expansion of fans according to the type of CSR activities of the club, and to examine influencing relationships among the types of CSR activities, regional attachment, and expansion of fans. Research design, data, and methodology: To achieve the purpose of the study, the survey was conducted on 150 home spectators of Seongnam Football Club, and the analysis of the data was conducted using SPSS Window Version 21.0. Correlation analysis, simple regression analysis, and multiple regression analysis were conducted to analyze the relationship between regional attachment and expansion of fans according to the types of CSR activities performed by the Seongnam Citizens Football Club. Results: As a result of the analysis, it was found that CSR activities had a statistically significant effect on regional attachment. In addition, CSR activities were found to have a statistically significant effect on expansion of fans. Finally, it was found that regional attachment had a statistically significant effect on the expansion of fans. Conclusions: Based on these results, CSR activities of the professional football club are considered an opportunity to build regional attachment. In addition, it is thought that the expansion of fans can be achieved through CSR activities.

공간분석을 이용한 심뇌혈관질환 사망률에 영향을 미치는 지역요인 분석 (A Study on the Regional Factors Affecting the Death Rates of Cardio-Cerebrovascular Disease Using the Spatial Analysis)

  • 박영용;박주현;박유현;이광수
    • 보건행정학회지
    • /
    • 제30권1호
    • /
    • pp.26-36
    • /
    • 2020
  • Background: The purpose of this study was to analyze the relationship between the regional characteristics and the age-adjusted cardio-cerebrovascular disease mortality rates (SCDMR) in 229 si·gun·gu administrative regions. Methods: SCDMR of man and woman was used as a dependent variable using the statistical data of death cause in 2017. As a representative index of regional characteristics, health behavior factors, socio-demographic and economic factors, physical environment factors, and health care factors were selected as independent variables. Ordinary least square (OLS) regression and geographically weighted regression (GWR) were performed to identify their relationship. Results: OLS analysis showed significant factors affecting the mortality rates of cardio-cerebrovascular disease as follows: high-risk drinking rates, the ratio of elderly living alone, financial independence, and walking practice rates. GWR analysis showed that the regression coefficients were varied by regions and the influence directions of the independent variables on the dependent variable were mixed. GWR showed higher adjusted R2 and Akaike information criterion values than those of OLS. Conclusion: If there is a spatial heterogeneity problem as Korea, it is appropriate to use the GWR model to estimate the influence of regional characteristics. Therefore, results using the GWR model suggest that it needs to establish customized health policies and projects for each region considering the socio-economic characteristics of each region.