• Title/Summary/Keyword: Weighted least squares method

Search Result 94, Processing Time 0.024 seconds

Genetic Contribution of Indigenous Yakutian Cattle to Two Hybrid Populations, Revealed by Microsatellite Variation

  • Li, M.H.;Nogovitsina, E.;Ivanova, Z.;Erhardt, G.;Vilkki, J.;Popov, R.;Ammosov, I.;Kiselyova, T.;Kantanen, J.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.18 no.5
    • /
    • pp.613-619
    • /
    • 2005
  • Indigenous Yakutian cattle' adaptation to the hardest subarctic conditions makes them a valuable genetic resource for cattle breeding in the Siberian area. Since early last century, crossbreeding between native Yakutian cattle and imported Simmental and Kholmogory breeds has been widely adopted. In this study, variations at 22 polymorphic microsatellite loci in 5 populations of Yakutian, Kholmogory, Simmental, Yakutian-Kholmogory and Yakutian-Simmental cattle were analysed to estimate the genetic contribution of Yakutian cattle to the two hybrid populations. Three statistical approaches were used: the weighted least-squares (WLS) method which considers all allele frequencies; a recently developed implementation of a Markov chain Monte Carlo (MCMC) method called likelihood-based estimation of admixture (LEA); and a model-based Bayesian admixture analysis method (STRUCTURE). At population-level admixture analyses, the estimate based on the LEA was consistent with that obtained by the WLS method. Both methods showed that the genetic contribution of the indigenous Yakutian cattle in Yakutian-Kholmogory was small (9.6% by the LEA and 14.2% by the WLS method). In the Yakutian-Simmental population, the genetic contribution of the indigenous Yakutian cattle was considerably higher (62.8% by the LEA and 56.9% by the WLS method). Individual-level admixture analyses using STRUCTURE proved to be more informative than the multidimensional scaling analysis (MDSA) based on individual-based genetic distances. Of the 9 Yakutian-Simmental animals studied, 8 showed admixed origin, whereas of the 14 studied Yakutian-Kholmogory animals only 2 showed Yakutian ancestry (>5%). The mean posterior distributions of individual admixture coefficient (q) varied greatly among the samples in both hybrid populations. This study revealed a minor existing contribution of the Yakutian cattle in the Yakutian-Kholmogory hybrid population, but in the Yakutian-Simmental hybrid population, a major genetic contribution of the Yakutian cattle was seen. The results reflect the different crossbreeding patterns used in the development of the two hybrid populations. Additionally, molecular evidence for differences among individual admixture proportions was seen in both hybrid populations, resulting from the stochastic process in crossing over generations.

Doubly-robust Q-estimation in observational studies with high-dimensional covariates (고차원 관측자료에서의 Q-학습 모형에 대한 이중강건성 연구)

  • Lee, Hyobeen;Kim, Yeji;Cho, Hyungjun;Choi, Sangbum
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.309-327
    • /
    • 2021
  • Dynamic treatment regimes (DTRs) are decision-making rules designed to provide personalized treatment to individuals in multi-stage randomized trials. Unlike classical methods, in which all individuals are prescribed the same type of treatment, DTRs prescribe patient-tailored treatments which take into account individual characteristics that may change over time. The Q-learning method, one of regression-based algorithms to figure out optimal treatment rules, becomes more popular as it can be easily implemented. However, the performance of the Q-learning algorithm heavily relies on the correct specification of the Q-function for response, especially in observational studies. In this article, we examine a number of double-robust weighted least-squares estimating methods for Q-learning in high-dimensional settings, where treatment models for propensity score and penalization for sparse estimation are also investigated. We further consider flexible ensemble machine learning methods for the treatment model to achieve double-robustness, so that optimal decision rule can be correctly estimated as long as at least one of the outcome model or treatment model is correct. Extensive simulation studies show that the proposed methods work well with practical sample sizes. The practical utility of the proposed methods is proven with real data example.

Improvement of Rating Curve Fitting Considering Variance Function with Pseudo-likelihood Estimation (의사우도추정법에 의한 분산함수를 고려한 수위-유량 관계 곡선 산정법 개선)

  • Lee, Woo-Seok;Kim, Sang-Ug;Chung, Eun-Sung;Lee, Kil-Seong
    • Journal of Korea Water Resources Association
    • /
    • v.41 no.8
    • /
    • pp.807-823
    • /
    • 2008
  • This paper presents a technique for estimating discharge rating curve parameters. In typical practical applications, the original non-linear rating curve is transformed into a simple linear regression model by log-transforming the measurement without examining the effect of log transformation. The model of pseudo-likelihood estimation is developed in this study to deal with heteroscedasticity of residuals in the original non-linear model. The parameters of rating curves and variance functions of errors are simultaneously estimated by the pseudo-likelihood estimation(P-LE) method. Simulated annealing, a global optimization technique, is adapted to minimize the log likelihood of the weighted residuals. The P-LE model was then applied to a hypothetical site where stage-discharge data were generated by incorporating various errors. Results of the P-LE model show reduced error values and narrower confidence intervals than those of the common log-transform linear least squares(LT-LR) model. Also, the limit of water levels for segmentation of discharge rating curve is estimated in the process of P-LE using the Heaviside function. Finally, model performance of the conventional log-transformed linear regression and the developed model, P-LE are computed and compared. After statistical simulation, the developed method is then applied to the real data sets from 5 gauge stations in the Geum River basin. It can be suggested that this developed strategy is applied to real sites to successfully determine weights taking into account error distributions from the observed discharge data.

Ordinary Kriging of Daily Mean SST (Sea Surface Temperature) around South Korea and the Analysis of Interpolation Accuracy (정규크리깅을 이용한 우리나라 주변해역 일평균 해수면온도 격자지도화 및 내삽정확도 분석)

  • Ahn, Jihye;Lee, Yangwon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.1
    • /
    • pp.51-66
    • /
    • 2022
  • SST (Sea Surface Temperature) is based on the atmosphere-ocean interaction, one of the most important mechanisms for the Earth system. Because it is a crucial oceanic and meteorological factor for understanding climate change, gap-free grid data at a specific spatial and temporal resolution is beneficial in SST studies. This paper examined the production of daily SST grid maps from 137 stations in 2020 through the ordinary kriging with variogram optimization and their accuracy assessment. The variogram optimization was achieved by WLS (Weighted Least Squares) method, and the blind tests for the interpolation accuracy assessment were conducted by an objective and spatially unbiased sampling scheme. The four-round blind tests showed a pretty high accuracy: a root mean square error between 0.995 and 1.035℃ and a correlation coefficient between 0.981 and 0.982. In terms of season, the accuracy in summer was a bit lower, presumably because of the abrupt change in SST affected by the typhoon. The accuracy was better in the far seas than in the near seas. West Sea showed better accuracy than East or South Sea. It is because the semi-enclosed sea in the near seas can have different physical characteristics. The seasonal and regional factors should be considered for accuracy improvement in future work, and the improved SST can be a member of the SST ensemble around South Korea.