• Title/Summary/Keyword: rank regression

Search Result 289, Processing Time 0.023 seconds

Random Regression Models Are Suitable to Substitute the Traditional 305-Day Lactation Model in Genetic Evaluations of Holstein Cattle in Brazil

  • Padilha, Alessandro Haiduck;Cobuci, Jaime Araujo;Costa, Claudio Napolis;Neto, Jose Braccini
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.29 no.6
    • /
    • pp.759-767
    • /
    • 2016
  • The aim of this study was to compare two random regression models (RRM) fitted by fourth ($RRM_4$) and fifth-order Legendre polynomials ($RRM_5$) with a lactation model (LM) for evaluating Holstein cattle in Brazil. Two datasets with the same animals were prepared for this study. To apply test-day RRM and LMs, 262,426 test day records and 30,228 lactation records covering 305 days were prepared, respectively. The lowest values of Akaike's information criterion, Bayesian information criterion, and estimates of the maximum of the likelihood function (-2LogL) were for $RRM_4$. Heritability for 305-day milk yield (305MY) was 0.23 ($RRM_4$), 0.24 ($RRM_5$), and 0.21 (LM). Heritability, additive genetic and permanent environmental variances of test days on days in milk was from 0.16 to 0.27, from 3.76 to 6.88 and from 11.12 to 20.21, respectively. Additive genetic correlations between test days ranged from 0.20 to 0.99. Permanent environmental correlations between test days were between 0.07 and 0.99. Standard deviations of average estimated breeding values (EBVs) for 305MY from $RRM_4$ and $RRM_5$ were from 11% to 30% higher for bulls and around 28% higher for cows than that in LM. Rank correlations between RRM EBVs and LM EBVs were between 0.86 to 0.96 for bulls and 0.80 to 0.87 for cows. Average percentage of gain in reliability of EBVs for 305-day yield increased from 4% to 17% for bulls and from 23% to 24% for cows when reliability of EBVs from RRM models was compared to those from LM model. Random regression model fitted by fourth order Legendre polynomials is recommended for genetic evaluations of Brazilian Holstein cattle because of the higher reliability in the estimation of breeding values.

Expression Profiles of Loneliness-associated Genes for Survival Prediction in Cancer Patients

  • You, Liang-Fu;Yeh, Jia-Rong;Su, Mu-Chun
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.1
    • /
    • pp.185-190
    • /
    • 2014
  • Influence of loneliness on human survival has been established epidemiologically, but genomic research remains undeveloped. We identified 34 loneliness-associated genes which were statistically significant for high-lonely and low-lonely individuals. With the univariate Cox proportional hazards regression model, we obtained corresponding regression coefficients for loneliness-associated genes fo individual cancer patients. Furthermore, risk scores could be generated with the combination of gene expression level multiplied by corresponding regression coefficients of loneliness-associated genes. We verified that high-risk score cancer patients had shorter mean survival time than their low-risk score counterparts. Then we validated the loneliness-associated gene signature in three independent brain cancer cohorts with Kaplan-Meier survival curves (n=77, 85 and 191), significantly separable by log-rank test with hazard ratios (HR) >1 and p-values <0.0001 (HR=2.94, 3.82, and 1.78). Moreover, we validated the loneliness-associated gene signature in bone cancer (HR=5.10, p-value=4.69e-3), lung cancer (HR=2.86, p-value=4.71e-5), ovarian cancer (HR=1.97, p-value=3.11e-5), and leukemia (HR=2.06, p-value=1.79e-4) cohorts. The last lymphoma cohort proved to have an HR=3.50, p-value=1.15e-7. Loneliness-associated genes had good survival prediction for cancer patients, especially bone cancer patients. Our study provided the first indication that expression of loneliness-associated genes are related to survival time of cancer patients.

Development of Approximate Cost Estimate Model for Aqueduct Bridges Restoration - Focusing on Comparison between Regression Analysis and Case-Based Reasoning - (수로교 개보수를 위한 개략공사비 산정 모델 개발 - 회귀분석과 사례기반추론의 비교를 중심으로 -)

  • Jeon, Geon Yeong;Cho, Jae Yong;Huh, Young
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.33 no.4
    • /
    • pp.1693-1705
    • /
    • 2013
  • To restore old aqueduct in Korea which is a irrigation bridge to supply water in paddy field area, it is needed to estimate approximate costs of restoration because the basic design for estimation of construction costs is often ruled out in current system. In this paper, estimating models of construction costs were developed on the basis of performance data for restoration of RC aqueduct bridges since 2003. The regression analysis (RA) model and case-based reasoning (CBR) model for the estimation of construction costs were developed respectively. Error rate of simple RA model was lower than that of multiple RA model. CBR model using genetic algorithm (GA) has been applied in the estimation of construction costs. In the model three factors like attribute weight, attribute deviation and rank of case similarity were optimized. Especially, error rate of estimated construction costs decreased since limit ranges of the attribute weights were applied. The results showed that error rates between RA model and CBR models were inconsiderable statistically. It is expected that the proposed estimating method of approximate costs of aqueduct restoration will be utilized to support quick decision making in phased rehabilitation project.

The Effect of the Incomplete Lactation Records for Genetic Evaluations with Random Regression Test-Day Models (RRTDM) in Holstein Cattle (불완전 검정일 기록이 RRTDM을 이용한 홀스타인 젖소의 유전평가에 미치는 영향)

  • Cho, J.H.;Cho, K.H.;Lee, K.J.
    • Journal of Animal Science and Technology
    • /
    • v.47 no.2
    • /
    • pp.147-158
    • /
    • 2005
  • The purpose of this study was to find out the effects that daughters' incomplete lactation records affect sire's breeding values through genetic evaluation using RRTDM(random regression test-day model). First, we estimated genetic parameters and breeding values on sires having complete lactation records of daughter by RRTDM, second, we changed complete lactation records of specific sires into incomplete records by various methods. Third, the breeding values were compared between complete and incomplete records. Finally, this study aimed to find out the methods to minimize the estimation errors of young bulls' breeding values. Data used in this study were collected from the dairy herd improvement program, and a total of 97,562 records were composed of 10,929 first parity with both parents known, since 1999. Breeding values on the daughters from randomly chosen sires were calculated and compared with among 90 day, 150day, and 200 day's incomplete records. For milk yields, sire's ranks of breeding values used by complete lactation records were very different from sire's ranks of breeding values obtained by incomplete lactation records(Rank_90 cut, 150cut, 200 cut).The differences were also obtained between complete lactation records(per305_full) and incomplete lactation record (per_90 cut, 150cut, 200 cut) in breeding values regarding persistency. Especially, the differences between per_90 cut and per305_full were very large(from 1.8 kg to 145kg).

Survey of the use of statistical methods in Journal of the Korean Association of Oral and Maxillofacial Surgeons

  • Choi, Yong-Geun
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.44 no.1
    • /
    • pp.25-28
    • /
    • 2018
  • Objectives: This study aimed to describe recent patterns in the types of statistical test used in original articles that were published in Journal of the Korean Association of Oral and Maxillofacial Surgeons. Materials and Methods: Thirty-six original articles published in the Journal in 2015 and 2016 were ascertained. The type of statistical test was identified by one researcher. Descriptive statistics, such as frequency, rank, and proportion, were calculated. Graphical statistics, such as a histogram, were constructed to reveal the overall utilization pattern of statistical test types. Results: Twenty-two types of statistical test were used. Statistical test type was not reported in four original articles and classified as unclear in 5%. The four most frequently used statistical tests constituted 47% of the total tests and these were the chi-square test, Student's t-test, Fisher's exact test, and Mann-Whitney test in descending order. Regression models, such as the Cox proportional hazard model and multiple logistic regression to adjust for potential confounding variables, were used in only 6% of the studies. Normality tests, including the Kolmogorov-Smirnov test, Levene test, Shapiro-Wilk test, and $Scheff{\acute{e}}^{\prime}s$ test, were used diversely but in only 10% of the studies. Conclusion: A total of 22 statistical tests were identified, with four tests occupying almost half of the results. Adoption of a nonparametric test is recommended when the status of normality is vague. Adjustment for confounding variables should be pursued using a multiple regression model when the number of potential confounding variables is numerous.

An Efficiency Analysis of Public Enterprises Using Bootstrap DEA (부트스트랩 DEA를 이용한 공기업 효율성 분석)

  • Park, Man Hee
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.5
    • /
    • pp.475-487
    • /
    • 2015
  • This study measures the managerial efficiency of Korea's 14 public enterprises using bootstrap DEA in 2013. In addition, it examines the factors that affect on the bootstrap bias-corrected efficiency using truncated regression analysis. The results and implications of this study are as follows. First, using bootstrap DEA model analysis, the results showed that the mean technical efficiency was 0.3182, the mean pure technical efficiency was 0.4994 and the mean scale efficiency was 0.6585. The main cause of technical inefficiency was due to pure technical inefficiency. Second, rank test between technical efficiency of general DEA model and bootstrap DEA model was no significant difference under CRS and VRS assumption. Third, the main cause of the inefficiency in 11 DMUs among 14 DMUs were mainly due to the pure technology and three DMUs were because of the scale efficiency. Finally, in the truncated regression analysis, cost of labor, profit, sales, return of equity, and the number of employees appeared as factors affecting the scale efficiency at the 10% significance level.

An Effect of Personal Assistance Services for the Disabled Persons upon the Burdens of Raising a Family - Focusing on Family Resilience Control Effect - (활동보조서비스가 가족부양부담에 미치는 영향 -가족탄력성 조절효과-)

  • Shin, Jun Ok
    • 재활복지
    • /
    • v.18 no.4
    • /
    • pp.95-117
    • /
    • 2014
  • This study aims to examine the effects of personal assistance services(physical activity support, homemaking activity support, social activity support) on caregiver burden and determine whether family resilience(family belief system, family cohesion, interaction) has a moderating effect between personal assistance services and caregiver burden, thereby presenting a reference data which can be used to seek a practical measure for handicapped welfare. This study was conducted on 200 primary caregivers with disabled family members of rank 1 or 2 in east, west, south, and north Gyeonggi-do using personal assistance services. Data was collected in 2013 from April 1 to May 15, and was analyzed using the SPSS 19.0 statistics program in which a moderated multiple regression analysis based on exploratory factor analysis, confirmatory factor analysis, and hierarchical regression analysis was performed. The primary conclusions of this study were as follows; First, the use of physical activity support was showed to have a positive effect in reducing family burden related to disabled care. Second, personal assistance services exhibit significant moderator effects related to family burden in family belief systems and family cohesion.

Increased Argonaute 2 Expression in Gliomas and its Association with Tumor Progression and Poor Prognosis

  • Feng, Bo;Hu, Peng;Lu, Shu-Jun;Chen, Jin-Bo;Ge, Ru-Li
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.9
    • /
    • pp.4079-4083
    • /
    • 2014
  • Background: Previous studies have showed that argonaute 2 is a potential factor related to genesis of several cancers, however, there have been no reports concerning gliomas. Methods: Paraffin specimens of 129 brain glioma cases were collected from a hospital affiliated to Binzhou Medical University from January 2008 to July 2013. We examined both argonaute 2 mRNA and protein expression by real-time quantitative PCR (qRT-PCR), Western blot analysis, and immunohistochemistry (IHC). The survival curves of the patients were determined using the Kaplan-Meier method and Cox regression, and the log-rank test was used for statistical evaluations. Results: Both argonaute 2 mRNA and protein were upregulated in high-grade when compared to low-grade tumor tissues. Multivariate analysis revealed that argonaute 2 protein expression was independently associated with the overall survival (HR=4.587, 95% CI: 3.001-6.993; P=0.002), and that argonaute 2 protein expression and WHO grading were independent prognostic factors for progression-free survival (HR=4.792, 95% CI: 3.993-5.672; P<0.001, and HR=2.109, 95% CI: 1.278-8.229; P=0.039, respectively). Kaplan-Meier analysis with the log-rank test indicated that high argonaute 2 protein expression had a significant impact on overall survival (P=0.0169) and progression-free survival (P=0.0324). Conclusions: The present study showed that argonaute 2 expression is up-regulated in gliomas. Argonaute 2 might also serve as a novel prognostic marker.

A Method to Evaluate Rate of 'Soft-Hard' In a Drawing (그림의 '부드러운-딱딱한' 정도의 평가 방법)

  • Yoon, Seok-Hoon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.12
    • /
    • pp.3963-3970
    • /
    • 2009
  • This study proposes a method to evaluate the level of 'soft-hard' of color quantitatively by evaluating the shape with edge sharpness automatically and by evaluating color in the color image scale in a drawing in art therapy using a computer. The dependent variable is the rank for the color experts to rate the level of 'soft-hard'. The mean and standard deviation of Value(V), and Chroma(C), colors, main color, clusters, length of edge, and sharp line rate of edge are considered as the independent variable. The appropriate independent variables to explain the dependent variable are selected through the step wise regression analysis. The inter-rater reliability of two raters is checked and the validity of developed system is verified by the rank correlations coefficient between the ranks of rater's and system's. This system can be used to evaluate of the shape or color in a drawing objectively and quantitatively for art therapy assessment, and to give the useful information to the fashion, textile, interior industry as well as color psychology and art therapy.

Tightly Coupled Integration of Ranking SVM and RDBMS (랭킹 SVM과 RDBMS의 밀결합 통합)

  • Song, Jae-Hwan;Oh, Jin-Oh;Yang, Eun-Seok;Yu, Hwan-Jo
    • Journal of KIISE:Databases
    • /
    • v.36 no.4
    • /
    • pp.247-253
    • /
    • 2009
  • Rank learning and processing have gained much attention in the IR and data mining communities for the last decade. While other data mining techniques such as classification and regression have been actively researched to interoperate with RDBMS by using the tightly coupled or loose coupling approaches, ranking has been researched independently without integrating into RDBMS. This paper proposes a tightly coupled integration of the Ranking SVM into MySQL in order to perform the rank learning task efficiently within the RDBMS. We implemented new SQL commands for learning ranking functions and predicting ranking scores. We evaluated our tightly coupled integration of Ranking SVM by comparing it to a loose coupling implementation. The experiment results show that our approach has a performance improvement of $10{\sim}40%$ in the training phase and 60% in the prediction phase.