• 제목/요약/키워드: Bayesian linear regression model

검색결과 60건 처리시간 0.025초

연속 강우-유출모형의 매개변수 지역화에 관한 연구 (A Study on Regionalization of Parameters of Continuous Rainfall-Runoff Model)

  • 정가인;김태정;권현한
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2015년도 학술발표회
    • /
    • pp.182-182
    • /
    • 2015
  • 우리나라에서는 강우관측시스템의 지역적 불균형으로 상대적으로 소규모 저수지의 경우 미계측유역의 특성을 가지며, 신뢰성 있는 강우량, 유출량, 증발량 자료가 매우 부족한 실정이다. 다목적댐 유역과 같은 계측유역의 경우 상류유역의 유입량 자료의 확보가 용이하지만 대부분의 유역의 경우 계측장비가 부족하여 신뢰성이 확보된 유입량 자료를 얻는데 많은 어려움이 있다. 본 연구에서는 미계측유역의 유입량 산정을 위하여 계측유역을 대상으로 강우-유출 모형의 매개변수를 산정하였으며, 산정된 매개변수를 유역특성인자와의 상관성을 토대로 다중선형회귀분석기법(multiple linear regression, MLR)을 적용하여 지역화(regionalization)를 위한 회귀식을 도출하였다. 이를 위해 양질의 유량자료가 확보된 K-water 17개 댐 유역을 대상으로 매개변수를 산정하였으며 이 중 2개의 댐 유역을 미계측유역으로 간주하여 개발된 모형을 검증하였다. 대부분의 통계 지표에서 우수한 모의능력을 확인하였으며, 본 연구를 통하여 개발된 지역화 기법을 미계측유역에 활용한다면 보다 정량적이고 효율적인 수자원 계획이 가능할 것으로 판단된다. 향후 연구로는 불확실성을 고려한 Bayesian GLM 모형을 이용한 지역화기법을 개발하여 매개변수의 불확실성까지 고려할 수 있는 방안을 모색하고자 한다.

  • PDF

산림재적 추정을 위한 계층적 베이지안 분석 (Hierarchical Bayesian analysis for a forest stand volume)

  • 송세리;박주원;김용구
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권1호
    • /
    • pp.29-37
    • /
    • 2017
  • 산림경영 계획을 위한 필요한 산림재적을 보다 효율적으로 추정하기 위해서 다양한 연구가 요구되어져 왔는데, 이러한 산림구조에 관한 연구는 주로 현장조사와 위성영상을 이용하여 이루어진다. 현장조사를 통한 연구는 비교적 정확하나 시간과 비용이 많이 들 뿐 아니라 접근의 용이성이 떨어지는 지역이 있기 때문에, 넓은 지역의 조사가 어렵다는 단점이 있다. 최근에는 항공기에서 발사된 레이저 펄스가 반사되어 돌아오는 시간을 측정하여 대상의 3차원 좌표를 얻는 LiDAR (Light Detection and Ranging) 기술을 활용하여 획득한 정밀한 수치형자료를 이용한 산림의 구조에 관한 연구가 이루어지고 있다. 일반적으로 산림재적을 추정하기 위해서 LiDAR자료를 이용한 수고자료와 산림 재적에 대한 회귀모형의 중요성이 점차 높아지는데, 국내의 경우 수목의 종류와 그 분포가 다르기 때문에 회귀모형만으로 재적을 추정하는 데 한계가 있다. 따라서 본 논문에서는 산림의 수고와 흉고직경을 측정하여 재적값을 추정하고 산림의 공간효과를 고려한 계층적 베이지안 분석을 통해 관측되지 않은 전체 산림재적에 대한 추정을 하고자 한다.

Inclusion of bioclimatic variables in genetic evaluations of dairy cattle

  • Negri, Renata;Aguilar, Ignacio;Feltes, Giovani Luis;Machado, Juliana Dementshuk;Neto, Jose Braccini;Costa-Maia, Fabiana Martins;Cobuci, Jaime Araujo
    • Animal Bioscience
    • /
    • 제34권2호
    • /
    • pp.163-171
    • /
    • 2021
  • Objective: Considering the importance of dairy farming and the negative effects of heat stress, more tolerant genotypes need to be identified. The objective of this study was to investigate the effect of heat stress via temperature-humidity index (THI) and diurnal temperature variation (DTV) in the genetic evaluations for daily milk yield of Holstein dairy cattle, using random regression models. Methods: The data comprised 94,549 test-day records of 11,294 first parity Holstein cows from Brazil, collected from 1997 to 2013, and bioclimatic data (THI and DTV) from 18 weather stations. Least square linear regression models were used to determine the THI and DTV thresholds for milk yield losses caused by heat stress. In addition to the standard model (SM, without bioclimatic variables), THI and DTV were combined in various ways and tested for different days, totaling 41 models. Results: The THI and DTV thresholds for milk yield losses was THI = 74 (-0.106 kg/d/THI) and DTV = 13 (-0.045 kg/d/DTV). The model that included THI and DTV as fixed effects, considering the two-day average, presented better fit (-2logL, Akaike information criterion, and Bayesian information criterion). The estimated breeding values (EBVs) and the reliabilities of the EBVs improved when using this model. Conclusion: Sires are re-ranking when heat stress indicators are included in the model. Genetic evaluation using the mean of two days of THI and DTV as fixed effect, improved EBVs and EBVs reliability.

공간예측모형에 기반한 산사태 취약성 지도 작성과 품질 평가 (Mapping Landslide Susceptibility Based on Spatial Prediction Modeling Approach and Quality Assessment)

  • 알-마문;박현수;장동호
    • 한국지형학회지
    • /
    • 제26권3호
    • /
    • pp.53-67
    • /
    • 2019
  • The purpose of this study is to identify the quality of landslide susceptibility in a landslide-prone area (Jinbu-myeon, Gangwon-do, South Korea) by spatial prediction modeling approach and compare the results obtained. For this goal, a landslide inventory map was prepared mainly based on past historical information and aerial photographs analysis (Daum Map, 2008), as well as some field observation. Altogether, 550 landslides were counted at the whole study area. Among them, 182 landslides are debris flow and each group of landslides was constructed in the inventory map separately. Then, the landslide inventory was randomly selected through Excel; 50% landslide was used for model analysis and the remaining 50% was used for validation purpose. Total 12 contributing factors, such as slope, aspect, curvature, topographic wetness index (TWI), elevation, forest type, forest timber diameter, forest crown density, geology, landuse, soil depth, and soil drainage were used in the analysis. Moreover, to find out the co-relation between landslide causative factors and incidents landslide, pixels were divided into several classes and frequency ratio for individual class was extracted. Eventually, six landslide susceptibility maps were constructed using the Bayesian Predictive Discriminant (BPD), Empirical Likelihood Ratio (ELR), and Linear Regression Method (LRM) models based on different category dada. Finally, in the cross validation process, landslide susceptibility map was plotted with a receiver operating characteristic (ROC) curve and calculated the area under the curve (AUC) and tried to extract success rate curve. The result showed that Bayesian, likelihood and linear models were of 85.52%, 85.23%, and 83.49% accuracy respectively for total data. Subsequently, in the category of debris flow landslide, results are little better compare with total data and its contained 86.33%, 85.53% and 84.17% accuracy. It means all three models were reasonable methods for landslide susceptibility analysis. The models have proved to produce reliable predictions for regional spatial planning or land-use planning.

Application of deep learning with bivariate models for genomic prediction of sow lifetime productivity-related traits

  • Joon-Ki Hong;Yong-Min Kim;Eun-Seok Cho;Jae-Bong Lee;Young-Sin Kim;Hee-Bok Park
    • Animal Bioscience
    • /
    • 제37권4호
    • /
    • pp.622-630
    • /
    • 2024
  • Objective: Pig breeders cannot obtain phenotypic information at the time of selection for sow lifetime productivity (SLP). They would benefit from obtaining genetic information of candidate sows. Genomic data interpreted using deep learning (DL) techniques could contribute to the genetic improvement of SLP to maximize farm profitability because DL models capture nonlinear genetic effects such as dominance and epistasis more efficiently than conventional genomic prediction methods based on linear models. This study aimed to investigate the usefulness of DL for the genomic prediction of two SLP-related traits; lifetime number of litters (LNL) and lifetime pig production (LPP). Methods: Two bivariate DL models, convolutional neural network (CNN) and local convolutional neural network (LCNN), were compared with conventional bivariate linear models (i.e., genomic best linear unbiased prediction, Bayesian ridge regression, Bayes A, and Bayes B). Phenotype and pedigree data were collected from 40,011 sows that had husbandry records. Among these, 3,652 pigs were genotyped using the PorcineSNP60K BeadChip. Results: The best predictive correlation for LNL was obtained with CNN (0.28), followed by LCNN (0.26) and conventional linear models (approximately 0.21). For LPP, the best predictive correlation was also obtained with CNN (0.29), followed by LCNN (0.27) and conventional linear models (approximately 0.25). A similar trend was observed with the mean squared error of prediction for the SLP traits. Conclusion: This study provides an example of a CNN that can outperform against the linear model-based genomic prediction approaches when the nonlinear interaction components are important because LNL and LPP exhibited strong epistatic interaction components. Additionally, our results suggest that applying bivariate DL models could also contribute to the prediction accuracy by utilizing the genetic correlation between LNL and LPP.

Refractive-index Prediction for High-refractive-index Optical Glasses Based on the B2O3-La2O3-Ta2O5-SiO2 System Using Machine Learning

  • Seok Jin Hong;Jung Hee Lee;Devarajulu Gelija;Woon Jin Chung
    • Current Optics and Photonics
    • /
    • 제8권3호
    • /
    • pp.230-238
    • /
    • 2024
  • The refractive index is a key material-design parameter, especially for high-refractive-index glasses, which are used for precision optics and devices. Increased demand for high-precision optical lenses produced by the glass-mold-press (GMP) process has spurred extensive studies of proper glass materials. B2O3, SiO2, and multiple heavy-metal oxides such as Ta2O5, Nb2O5, La2O3, and Gd2O3 mostly compose the high-refractive-index glasses for GMP. However, due to many oxides including up to 10 components, it is hard to predict the refractivity solely from the composition of the glass. In this study, the refractive index of optical glasses based on the B2O3-La2O3-Ta2O5-SiO2 system is predicted using machine learning (ML) and compared to experimental data. A dataset comprising up to 271 glasses with 10 components is collected and used for training. Various ML algorithms (linear-regression, Bayesian-ridge-regression, nearest-neighbor, and random-forest models) are employed to train the data. Along with composition, the polarizability and density of the glasses are also considered independent parameters to predict the refractive index. After obtaining the best-fitting model by R2 value, the trained model is examined alongside the experimentally obtained refractive indices of B2O3-La2O3-Ta2O5-SiO2 quaternary glasses.

Improvement of inspection system for common crossings by track side monitoring and prognostics

  • Sysyn, Mykola;Nabochenko, Olga;Kovalchuk, Vitalii;Gruen, Dimitri;Pentsak, Andriy
    • Structural Monitoring and Maintenance
    • /
    • 제6권3호
    • /
    • pp.219-235
    • /
    • 2019
  • Scheduled inspections of common crossings are one of the main cost drivers of railway maintenance. Prognostics and health management (PHM) approach and modern monitoring means offer many possibilities in the optimization of inspections and maintenance. The present paper deals with data driven prognosis of the common crossing remaining useful life (RUL) that is based on an inertial monitoring system. The problem of scheduled inspections system for common crossings is outlined and analysed. The proposed analysis of inertial signals with the maximal overlap discrete wavelet packet transform (MODWPT) and Shannon entropy (SE) estimates enable to extract the spectral features. The relevant features for the acceleration components are selected with application of Lasso (Least absolute shrinkage and selection operator) regularization. The features are fused with time domain information about the longitudinal position of wheels impact and train velocities by multivariate regression. The fused structural health (SH) indicator has a significant correlation to the lifetime of crossing. The RUL prognosis is performed on the linear degradation stochastic model with recursive Bayesian update. Prognosis testing metrics show the promising results for common crossing inspection scheduling improvement.

공간적 연관구조를 고려한 총범죄 자료 분석 (Analysis of Total Crime Count Data Based on Spatial Association Structure)

  • 최정순;박만식;원유복;김학열;허태영
    • 응용통계연구
    • /
    • 제23권2호
    • /
    • pp.335-344
    • /
    • 2010
  • 공간자료분석에서 공간적 상관성을 배제한 일반적인 회귀모형을 통한 모수 추정값들은 신뢰성의 문제가 지적 되어 오고 있다. 본 연구에서는 공간자료의 상관성을 고려한 모형을 구축하기 위하여 일변량 조건부자기회귀모형을 이용하였으며 베이지안 기법을 통하여 모수를 추정하고 공간상관성이 고려된 공간 가산자료모형과 고려되지 않은 일반 가산자료모형을 비교하였다. 연구 대상으로는 서울시의 25개 행정자치구별 총범죄 자료를 이용하였으며 자료분석을 통하여 도시계획과 같은 국가 정책의 수립에 참고자료로 활용될 수 있으리라 판단된다.

Allometric equation for estimating aboveground biomass of Acacia-Commiphora forest, southern Ethiopia

  • Wondimagegn Amanuel;Chala Tadesse;Moges Molla;Desalegn Getinet;Zenebe Mekonnen
    • Journal of Ecology and Environment
    • /
    • 제48권2호
    • /
    • pp.196-206
    • /
    • 2024
  • Background: Most of the biomass equations were developed using sample trees collected mainly from pan-tropical and tropical regions that may over- or underestimate biomass. Site-specific models would improve the accuracy of the biomass estimates and enhance the country's measurement, reporting, and verification activities. The aim of the study is to develop site-specific biomass estimation models and validate and evaluate the existing generic models developed for pan-tropical forest and newly developed allometric models. Total of 140 trees was harvested from each diameter class biomass model development. Data was analyzed using SAS procedures. All relevant statistical tests (normality, multicollinearity, and heteroscedasticity) were performed. Data was transformed to logarithmic functions and multiple linear regression techniques were used to develop model to estimate aboveground biomass (AGB). The root mean square error (RMSE) was used for measuring model bias, precision, and accuracy. The coefficient of determination (R2 and adjusted [adj]-R2), the Akaike Information Criterion (AIC) and the Schwarz Bayesian information Criterion was employed to select most appropriate models. Results: For the general total AGB models, adj-R2 ranged from 0.71 to 0.85, and model 9 with diameter at stump height at 10 cm (DSH10), ρ and crown width (CW) as predictor variables, performed best according to RMSE and AIC. For the merchantable stem models, adj-R2 varied from 0.73 to 0.82, and model 8) with combination of ρ, diameter at breast height and height (H), CW and DSH10 as predictor variables, was best in terms of RMSE and AIC. The results showed that a best-fit model for above-ground biomass of tree components was developed. AGBStem = exp {-1.8296 + 0.4814 natural logarithm (Ln) (ρD2H) + 0.1751 Ln (CW) + 0.4059 Ln (DSH30)} AGBBranch = exp {-131.6 + 15.0013 Ln (ρD2H) + 13.176 Ln (CW) + 21.8506 Ln (DSH30)} AGBFoliage = exp {-0.9496 + 0.5282 Ln (DSH30) + 2.3492 Ln (ρ) + 0.4286 Ln (CW)} AGBTotal = exp {-1.8245 + 1.4358 Ln (DSH30) + 1.9921 Ln (ρ) + 0.6154 Ln (CW)} Conclusions: The results demonstrated that the development of local models derived from an appropriate sample of representative species can greatly improve the estimation of total AGB.

Genomic partitioning of growth traits using a high-density single nucleotide polymorphism array in Hanwoo (Korean cattle)

  • Park, Mi Na;Seo, Dongwon;Chung, Ki-Yong;Lee, Soo-Hyun;Chung, Yoon-Ji;Lee, Hyo-Jun;Lee, Jun-Heon;Park, Byoungho;Choi, Tae-Jeong;Lee, Seung-Hwan
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제33권10호
    • /
    • pp.1558-1565
    • /
    • 2020
  • Objective: The objective of this study was to characterize the number of loci affecting growth traits and the distribution of single nucleotide polymorphism (SNP) effects on growth traits, and to understand the genetic architecture for growth traits in Hanwoo (Korean cattle) using genome-wide association study (GWAS), genomic partitioning, and hierarchical Bayesian mixture models. Methods: GWAS: A single-marker regression-based mixed model was used to test the association between SNPs and causal variants. A genotype relationship matrix was fitted as a random effect in this linear mixed model to correct the genetic structure of a sire family. Genomic restricted maximum likelihood and BayesR: A priori information included setting the fixed additive genetic variance to a pre-specified value; the first mixture component was set to zero, the second to 0.0001×σ2g, the third 0.001×σ2g, and the fourth to 0.01×σ2g. BayesR fixed a priori information was not more than 1% of the genetic variance for each of the SNPs affecting the mixed distribution. Results: The GWAS revealed common genomic regions of 2 Mb on bovine chromosome 14 (BTA14) and 3 had a moderate effect that may contain causal variants for body weight at 6, 12, 18, and 24 months. This genomic region explained approximately 10% of the variance against total additive genetic variance and body weight heritability at 12, 18, and 24 months. BayesR identified the exact genomic region containing causal SNPs on BTA14, 3, and 22. However, the genetic variance explained by each chromosome or SNP was estimated to be very small compared to the total additive genetic variance. Causal SNPs for growth trait on BTA14 explained only 0.04% to 0.5% of the genetic variance Conclusion: Segregating mutations have a moderate effect on BTA14, 3, and 19; many other loci with small effects on growth traits at different ages were also identified.