• 제목/요약/키워드: rank prediction

검색결과 93건 처리시간 0.023초

Accuracy of genomic-polygenic estimated breeding value for milk yield and fat yield in the Thai multibreed dairy population with five single nucleotide polymorphism sets

  • Wongpom, Bodin;Koonawootrittriron, Skorn;Elzo, Mauricio A.;Suwanasopee, Thanathip;Jattawa, Danai
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제32권9호
    • /
    • pp.1340-1348
    • /
    • 2019
  • Objective: The objectives were to compare variance components, genetic parameters, prediction accuracies, and genomic-polygenic estimated breeding value (EBV) rankings for milk yield (MY) and fat yield (FY) in the Thai multibreed dairy population using five single nucleotide polymorphism (SNP) sets from GeneSeek GGP80K chip. Methods: The dataset contained monthly MY and FY of 8,361 first-lactation cows from 810 farms. Variance components, genetic parameters, and EBV for five SNP sets from the GeneSeek GGP80K chip were obtained using a 2-trait single-step average-information restricted maximum likelihood procedure. The SNP sets were the complete SNP set (all available SNP; SNP100), top 75% set (SNP75), top 50% set (SNP50), top 25% set (SNP25), and top 5% set (SNP5). The 2-trait models included herd-year-season, heterozygosity and age at first calving as fixed effects, and animal additive genetic and residual as random effects. Results: The estimates of additive genetic variances for MY and FY from SNP subsets were mostly higher than those of the complete set. The SNP25 MY and FY heritability estimates (0.276 and 0.183) were higher than those from SNP75 (0.265 and 0.168), SNP50 (0.275 and 0.179), SNP5 (0.231 and 0.169), and SNP100 (0.251and 0.159). The SNP25 EBV accuracies for MY and FY (39.76% and 33.82%) were higher than for SNP75 (35.01% and 32.60%), SNP50 (39.64% and 33.38%), SNP5 (38.61% and 29.70%), and SNP100 (34.43% and 31.61%). All rank correlations between SNP100 and SNP subsets were above 0.98 for both traits, except for SNP100 and SNP5 (0.93 for MY; 0.92 for FY). Conclusion: The high SNP25 estimates of genetic variances, heritabilities, EBV accuracies, and rank correlations between SNP100 and SNP25 for MY and FY indicated that genotyping animals with SNP25 dedicated chip would be a suitable to maintain genotyping costs low while speeding up genetic progress for MY and FY in the Thai dairy population.

산지사면(山地斜面)의 붕괴위험도(崩壞危險度) 예측(豫測)모델의 개발(開發) 및 실용화(實用化) 방안(方案) (Studies on Development of Prediction Model of Landslide Hazard and Its Utilization)

  • 마호섭
    • 한국산림과학회지
    • /
    • 제83권2호
    • /
    • pp.175-190
    • /
    • 1994
  • 산지사면붕괴(山地斜面崩壞)에 의(依)한 피해(被害)를 예방(豫防) 또는 극소화(極小化)하기 위(爲)하여 산사태(山沙汰)가 자주 발생(發生)하는 지역(地域)을 중심(中心)으로 각(各) 조사(調査) 단위사면(單位斜面)에 대(對)하여 산지사면붕괴(山地斜面崩壞)에 영향(影響)하리라고 예상(豫想)되는 12개(個) 환경요인(環境要因)을 현지조사(現地調査)하고 붕괴(崩壞)와 환경인자(環境因子)와의 관계(關係)를 수량화(數量化) 이론(理論)에 의(依)한 방법(方法)으로 분석(分析)하여 산사태발생(山沙汰發生)의 위험도(危險度)를 평가(評價)할 수 있는 예측(豫測)모델을 도출(導出)하였으며, 또한 이를 기초(基礎)로 하여 위험도(危險度)를 각(各) 급별(級別)로 구분(區分)하고 예지(豫知)모델을 검토(檢討)하였던 바 그 결과(結果)을 요약(要約)하면 다음과 같다. 산지붕괴발생면적(山地崩壞發生面積)에 영향(影響)을 주는 인자(因子)는 강우(降雨), 령급(齡級), 표고(標高), 토성(土性), 경사(傾斜), 사면위치(斜面位置), 임상(林相), 곡차수(谷次數), 종단사면형(縱斷斜面形), 모암(母岩), 토심(土深), 방위(方位)의 순(順)이었으며, 편상관계수(偏相關係數)에 의(依)한 인자(因子)는 령급(令級), 강우(降雨), 토성(土性), 모암(母岩), 경사(傾斜), 사면위치(斜面位置), 표고(標高), 종단사면형(縱斷斜面形), 곡차수(谷次數), 임상(林相), 토심(土深), 방위(方位)의 순(順)으로 나타났다. 또한 산지붕괴발생빈도(山地崩壞發生頻度)에 의(依)한 인자(因子)의 순위(順位)는 령급(令級), 표고(標高), 토성(土性), 경사도(傾斜度), 식생(植生), 강우(降雨), 종단사면(縱斷斜面), 곡차수(谷次數), 모암(母岩), 토심(土深)이었으며 사면위치(斜面位置) 및 방위(方位)는 기여도(寄與度)가 낮게 나타났다. 산지사면붕괴위험(山地斜面崩壞危險) 예지(豫知)를 위(爲)하여 붕괴발생면적(崩壞發生面積)에 의(依)한 예측(豫測)모델에서 위험도(危險度) 예측점수표(豫測點數表)를 작성(作成)할 수 있었으며, 점수합계(點數合計)가 9.1636이면 붕괴발생위험(崩壞發生危險)이 높은 것으로 평가(評價)되었으며 산지(山地) 사면붕괴(斜面崩壞)가 발생(發生)한 사면(斜面)과 발생(發生)하지 않은 사면(斜面)에 의(依)한 예측(豫測)모델에서 우사면(雨斜面)에 대(對)한 사면판별(斜面判別) 구분치(區分値)는 -0.02였고, 그 적중율(適中率)은 73%로 높았다. 또한 판별구분치(判別區分値)를 기준(基準)으로 한 산지사면붕괴발생(山地斜面崩壞發生) 위험도별(危險度別) 점수(點數)는 A급(級)은 0.3132 이상(以上)이었고, B급(級)은 0.3132~-0.1051, C급(級)은 -0.1050~-0.4195, D급(級)은 -0.4195 이하(以下)였다. 그리고 산지사면붕괴발생(山地斜面崩壞發生)의 예지(豫知)는 판별구분치(判別區分値)를 기준(基準)으로 위험도(危險度)을 A, B, C, D의 4등급(等級)으로 구분(區分)할 수 있었으며, 총(總) 300개(個) 사면(斜面) 중(中) A급사면(級斜面) 68개(個), B급사면(級斜面) 115개(個), C급사면(級斜面) 65개, D급사면(級斜面) 52개(個)였다. 위험도(危險度) A, B급(級)에서의 산사태발생(山沙汰發生)은 150개(個) 붕괴지(崩壞地) 중 125개(個)로서 약(約) 83.3%의 높은 적중율(適中率)을 보여 예측(豫測)모델로서 응용(應用) 가능성(可能性)이 높게 나타났다. 따라서 이러한 예지방법(豫知方法)에 의(依)하여 선정(選定)한 위험(危險)한 지역(地域)에 대(對)하여 산지재해위험도(山地災害危險度) 지도(地圖)를 작성(作成)하여 토지이용(土地利用) 계획(計劃) 및 재해위험지(災害危險地) 선정기준(選定基準)의 행정지표(行政指標)로서 활용(活用)할 수 있을 것이다. 또한 산지재해(山地災害)에 대(對)한 종합(綜合) 대책(對策)에 유용(有用)하게 활용(活用)함으로써 막대(莫大)한 재산(財産) 피해(被害)와 인명(人命) 손실(損失)을 사전(事前)에 방지(防止)할 수 있을 것이다.

  • PDF

The Significance of Serum Carcinoembryonic Antigen in Lung Adenocarcinoma

  • Kim, Jae Jun;Hyun, Kwanyong;Park, Jae Kil;Moon, Seok Whan
    • Journal of Chest Surgery
    • /
    • 제48권5호
    • /
    • pp.335-344
    • /
    • 2015
  • Background: A raised carcinoembryonic antigen (CEA) may be associated with significant pathology during the postoperative follow-up of lung adenocarcinoma. Methods: We reviewed the medical records of 305 patients who underwent surgical resections for primary lung adenocarcinoma at a single institution between April 2006 and February 2013. Results: Preoperative CEA levels were significantly associated with age, smoking history, pathologic stage including pT (pathologic tumor stge), pN (pathologic nodal stage) and overall pathological stage, tumor size and differentiation, pathologically positive total lymph node, N1 and N2 lymph node, N2 nodal station (0/1/2=1.83/2.94/7.21 ng/mL, p=0.019), and 5-year disease-free survival (0.591 in group with normal preoperative CEA levels vs. 0.40 in group with high preoperative CEA levels, p=0.001). Preoperative CEA levels were significantly higher than postoperative CEA levels (p<0.001, Wilcoxon signed-rank test). Postoperative CEA level was also significantly associated with disease-free survival (p<0.001). A follow-up serum CEA value of >2.57 ng/mL was found to be the appropriate cutoff value for the prediction of cancer recurrence with sensitivity and specificity of 71.4% and 72.3%, respectively. Twenty percent of patients who had recurrence of disease had a CEA level elevated above this cutoff value prior to radiographic evidence of recurrence. Postoperative CEA, pathologic stage, differentiation, vascular invasion, and neoadjuvant therapy were identified as independent predictors of 5-year disease-free survival in a multivariate analysis. Conclusion: The follow-up CEA level can be a useful tool for detecting early recurrence undetected by postoperative imaging studies. The perioperative follow-up CEA levels may be helpful for providing personalized evaluation of lung adenocarcinoma.

머신러닝을 활용한 지역축제 방문객 수 예측모형 개발 (Development of a Model to Predict the Number of Visitors to Local Festivals Using Machine Learning)

  • 이인지;윤현식
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제29권3호
    • /
    • pp.35-52
    • /
    • 2020
  • Purpose Local governments in each region actively hold local festivals for the purpose of promoting the region and revitalizing the local economy. Existing studies related to local festivals have been actively conducted in tourism and related academic fields. Empirical studies to understand the effects of latent variables on local festivals and studies to analyze the regional economic impacts of festivals occupy a large proportion. Despite of practical need, since few researches have been conducted to predict the number of visitors, one of the criteria for evaluating the performance of local festivals, this study developed a model for predicting the number of visitors through various observed variables using a machine learning algorithm and derived its implications. Design/methodology/approach For a total of 593 festivals held in 2018, 6 variables related to the region considering population size, administrative division, and accessibility, and 15 variables related to the festival such as the degree of publicity and word of mouth, invitation singer, weather and budget were set for the training data in machine learning algorithm. Since the number of visitors is a continuous numerical data, random forest, Adaboost, and linear regression that can perform regression analysis among the machine learning algorithms were used. Findings This study confirmed that a prediction of the number of visitors to local festivals is possible using a machine learning algorithm, and the possibility of using machine learning in research in the tourism and related academic fields, including the study of local festivals, was captured. From a practical point of view, the model developed in this study is used to predict the number of visitors to the festival to be held in the future, so that the festival can be evaluated in advance and the demand for related facilities, etc. can be utilized. In addition, the RReliefF rank result can be used. Considering this, it will be possible to improve the existing local festivals or refer to the planning of a new festival.

Functional Prediction of Imprinted Genes in Chicken Based on a Mammalian Comparative Expression Network

  • Kim, Hyo-Young;Moon, Sun-Jin;Kim, Hee-Bal
    • Genomics & Informatics
    • /
    • 제6권1호
    • /
    • pp.32-35
    • /
    • 2008
  • Little evidence supports the existence of imprinted genes in chicken. Imprinted genes are thought to be intimately connected with the acquisition of parental resources in mammals; thus, the predicted lack of this type of gene in chicken is not surprising, given that they leave their offspring to their own heritance after conception. In this study, we identified several imprinted genes and their orthologs in human, mouse, and zebrafish, including 30 previously identified human and mouse imprinted genes. Next, using the HomoloGene database, we identified six orthologous genes in human, mouse, and chicken; however, no orthologs were identified for SLC22A18, and mouse Ppp1r9a was not included in the HomoloGene database. Thus, from our analysis, four candidate chicken imprinted genes (IGF2, UBE3A, PHLDA2, and GRB10) were identified. To expand our analysis, zebrafish was included, but no probe ID for UBE3A exists in this species. Thus, ultimately, three candidate imprinted genes (IGF2, PHLDA2, and GRB10) in chicken were identified. GRB10 was not significant in chicken and zebrafish based on the Wilcoxon-Mann-Whitney test, whereas a weak correlation between PHLDA2 in chicken and human was identified from the Spearman's rank correlation coefficient. Significant associations between human, mouse, chicken, and zebrafish were found for IGF2 and GRB10 using the Friedman's test. Based on our results, IGF2, PHLDA2, and GRB10 are candidate imprinted genes in chicken. Importantly, the strongest candidate was PHLDA2.

Bayesian Survival Analysis of High-Dimensional Microarray Data for Mantle Cell Lymphoma Patients

  • Moslemi, Azam;Mahjub, Hossein;Saidijam, Massoud;Poorolajal, Jalal;Soltanian, Ali Reza
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권1호
    • /
    • pp.95-100
    • /
    • 2016
  • Background: Survival time of lymphoma patients can be estimated with the help of microarray technology. In this study, with the use of iterative Bayesian Model Averaging (BMA) method, survival time of Mantle Cell Lymphoma patients (MCL) was estimated and in reference to the findings, patients were divided into two high-risk and low-risk groups. Materials and Methods: In this study, gene expression data of MCL patients were used in order to select a subset of genes for survival analysis with microarray data, using the iterative BMA method. To evaluate the performance of the method, patients were divided into high-risk and low-risk based on their scores. Performance prediction was investigated using the log-rank test. The bioconductor package "iterativeBMAsurv" was applied with R statistical software for classification and survival analysis. Results: In this study, 25 genes associated with survival for MCL patients were identified across 132 selected models. The maximum likelihood estimate coefficients of the selected genes and the posterior probabilities of the selected models were obtained from training data. Using this method, patients could be separated into high-risk and low-risk groups with high significance (p<0.001). Conclusions: The iterative BMA algorithm has high precision and ability for survival analysis. This method is capable of identifying a few predictive variables associated with survival, among many variables in a set of microarray data. Therefore, it can be used as a low-cost diagnostic tool in clinical research.

Changing patterns of Serum CEA and CA199 for Evaluating the Response to First-line Chemotherapy in Patients with Advanced Gastric Adenocarcinoma

  • He, Bo;Zhang, Hui-Qing;Xiong, Shu-Ping;Lu, Shan;Wan, Yi-Ye;Song, Rong-Feng
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제16권8호
    • /
    • pp.3111-3116
    • /
    • 2015
  • Background: This study was designed to investigate the value of CEA and CA199 in predicting the treatment response to palliative chemotherapy for advanced gastric cancer. Materials and Methods: We studied 189 patients with advanced gastric cancer who received first-line chemotherapy, measured the serum CEA and CA199 levels, used RECIST1.1 as the gold standard and analyzed the value of CEA and CA199 levels changes in predicting the treatment efficacy of chemotherapy. Results: Among the 189 patients, 80 and 94 cases had increases of baseline CEA (${\geq}5ng/ml$) and CA199 levels (${\geq}27U/ml$), respectively. After two cycles of chemotherapy, 42.9% patients showed partial remission, 33.3% stable disease, and 23.8% progressive disease. The area under the ROC curve (AUC) for CEA and CA199 reduction in predicting effective chemotherapy were 0.828 (95%CI 0.740-0.916) and 0.897 (95%CI 0.832-0.961). The AUCs for CEA and CA199 increase in predicting progression after chemotherapy were 0.923 (95%CI 0.865-0.980) and 0.896 (95%CI 0.834-0.959), respectively. Patients who exhibited a CEA decline ${\geq}24%$ and a CA199 decline ${\geq}29%$ had significantly longer PFS (log rank p=0.001, p<0.001). With the exception of patients who presented with abnormal levels after chemotherapy, changes of CEA and CA199 levels had limited value for evaluating the chemotherapy efficacy in patients with normal baseline tumor markers. Conclusions: Changes in serum CEA and CA199 levels can accurately predict the efficacy of first-line chemotherapy in advanced gastric cancer. Patients with levels decreasing beyond the optimal critical values after chemotherapy have longer PFS.

Micro PAVER의 국내 적용을 위한 적정화에 대한 연구 (The Study On Customization for Domestic Application of Micro PAVER)

  • 안덕순;권수안;서영찬
    • 한국도로학회논문집
    • /
    • 제5권3호
    • /
    • pp.21-29
    • /
    • 2003
  • 현재 공항포장은 기존 포장관련정보의 관리에 대한 어려움이 있고, 장기공용으로 인해 유지보수비용이 증가하고 있어 예산의 효율적 활용과 공항포장의 체계적 유지관리가 필요하다. 이를 위해서 포장관리시스템을 도입할 필요성이 있다. 본 연구에서는 외국의 여러 포장관리시스템 중 세계적으로 널리 활용되고 있는 Micro PAVER를 국내에 도입하기 위한 방안을 연구하였다. 외국의 시스템을 국내에 적용하기 위해서는 환경차이, 포장관리수준 차이 등으로 인해 발생하는 오류를 최소화하기 위해 국내에 맞도록 적정화(customization)하는 연구가 필요하다. 따라서 본 연구에서는 Micro PAVER를 국내에 도입하기 위해 필요한 적정화 연구를 수행하였다 Micro PAVER의 논리 및 구조를 분석하여 Micro PAVER 운영에 주요하게 작용하는 부분을 기존 포장평가자료와 포장 전문가의 설문조사 방법을 이용하여 적정화하였다. 주요 적정화 부분은 포장파손예측모형, critical PCI, PCI에 따른 유지보수비용, 유지보수공법 및 단가, PCI 등급이다.

  • PDF

Fuzzy Clustering with Genre Preference for Collaborative Filtering

  • Lee, Soojung
    • 한국컴퓨터정보학회논문지
    • /
    • 제25권5호
    • /
    • pp.99-106
    • /
    • 2020
  • 협력 필터링 기반의 추천 시스템에 내재된 확장성 문제는 지난 수십년간 관련 연구의 이슈가 되어 왔다. 클러스터링은 이 문제를 해결하는 유명한 기술인데 낮은 성능으로 인하여 활발히 연구되어 오진 않았다. 본 논문에서는 협력 필터링 시스템의 고질적인 단점인 확장성 문제를 극복하기 위하여 클러스터링 기법을 채택하였다. 또한 클러스터링을 적용함으로 인해 초래되는 성능저하 문제를 개선하기 위해, 두 가지 전략을 사용하였는데, 첫째는 퍼지 클러스터링이며, 둘째는 영화 장르에 대한 사용자 선호도에 기반한 유사도 측정 방법을 제안하고 이를 적용하였다. 본 연구에서의 제안 방법을 기존의 여러 관련 방법들과 비교 실험을 통해 다양한 주요 성능 척도에 의거하여 평가하였는데, 실험 결과 제안 방법은 예측과 순위 정확도 측면에서 더 우수한 성능을 보였고, 추천 정확도 측면에서는 실험 대상 중 최상의 방법과 대등한 성능을 나타냈다.

Matrix Metalloproteinase-13 - A Potential Biomarker for Detection and Prognostic Assessment of Patients with Esophageal Squamous Cell Carcinoma

  • Sedighi, Maryam;Aledavood, Seyed Amir;Abbaszadegan, MR;Memar, Bahram;Montazer, Mehdi;Rajabian, Majid;Gholamin, Mehran
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권6호
    • /
    • pp.2781-2785
    • /
    • 2016
  • Background: Matric metalloproteinase (MMP) 13 gene expression is increased in esophageal squamous cell carcinomas (ESCCs) and associated with increasing tumor invasion, lymph node involvement and decreased survival rates. Levels of the circulating enzyme may be elevated and used as a marker of tumor progression. In this study, clinical application of MMP-13 serum levels was evaluated for early detection, prediction of prognosis and survival time of ESCC patients. Materials and Methods: Serum levels of MMP13 were determined by ELISA in 66 ESCC patients prior of any treatment and 54 healthy controls for comparison with clinicopathological data through statistical analysis with Man Whitney U and Log-Rank tests. In addition, clinical value of MMP13 levels for diagnosis was evaluated by receiver operating characteristic (ROC) test. Results: The serum level of MMP-13 in patients (>250 pg/ml) was significantly higher than in the control group (<100 pg/ml) (p value=0.004). Also the results showed a significant correlation between MMP-13 serum levels with tumor stage (p value = 0.003), depth of tumor invasion (p value=0.008), involvement of lymph nodes (p value = 0.011), tumor size (p value = 0.018) and survival time. While there were no significant correlation with grade and location of tumors. ROC analysis showed that MMP-13 level is an accurate diagnostic marker especially to differentiate pre-invasive/ invasive lesions from normal controls (sensitivity and specificity: 100%). Conclusions: These findings indicate a potential clinical significance of serum MMP13 measurement for early detection and prognostic assessment in ESCC patients.