• Title/Summary/Keyword: Prediction of survival

Search Result 211, Processing Time 0.03 seconds

Accessing the Clustering of TNM Stages on Survival Analysis of Lung Cancer Patient (폐암환자 생존분석에 대한 TNM 병기 군집분석 평가)

  • Choi, Chulwoong;Kim, Kyungbaek
    • Smart Media Journal
    • /
    • v.9 no.4
    • /
    • pp.126-133
    • /
    • 2020
  • The treatment policy and prognosis are determined based on the final stage of lung cancer patients. The final stage of lung cancer patients is determined based on the T, N, and M stage classification table provided by the American Cancer Society (AJCC). However, the final stage of AJCC has limitations in its use for various fields such as patient treatment, prognosis and survival days prediction. In this paper, clustering algorithm which is one of non-supervised learning algorithms was assessed in order to check whether using only T, N, M stages with a data science method is effective for classifying the group of patients in the aspect of survival days. The final stage groups and T, N, M stage clustering groups of lung cancer patients were compared by using the cox proportional hazard model. It is confirmed that the accuracy of prediction of survival days with only T, N, M stages becomes higher than the accuracy with the final stages of patients. Especially, the accuracy of prediction of survival days with clustering of T, N, M stages improves when more or less clusters are analyzed than the seven clusters which is same to the number of final stage of AJCC.

Clinico-pathologic Factors and Machine Learning Algorithm for Survival Prediction in Parotid Gland Cancer (귀밑샘 암종에서 생존 예측을 위한 임상병리 인자 분석 및 머신러닝 모델의 구축)

  • Kwak, Seung Min;Kim, Se-Heon;Choi, Eun Chang;Lim, Jae-Yol;Koh, Yoon Woo;Park, Young Min
    • Korean Journal of Head & Neck Oncology
    • /
    • v.38 no.1
    • /
    • pp.17-24
    • /
    • 2022
  • Background/Objectives: This study analyzed the prognostic significance of clinico-pathologic factors including comprehensive nodal factors in parotid gland cancers (PGCs) patients and constructed a survival prediction model for PGCs patients using machine learning techniques. Materials & Methods: A total of 131 PGCs patients were enrolled in the study. Results: There were 19 cases (14.5%) of lymph nodes (LNs) at the lower neck level and 43 cases (32.8%) involved multiple level LNs metastases. There were 2 cases (1.5%) of metastases to the contralateral LNs. Intraparotid LNs metastasis was observed in 6 cases (4.6%) and extranodal extension (ENE) findings were observed in 35 cases (26.7%). Lymphovascular invasion (LVI) and perineural invasion findings were observed in 42 cases (32.1%) and 49 cases (37.4%), respectively. Machine learning prediction models were constructed using clinico-pathologic factors including comprehensive nodal factors and Decision Tree and Stacking model showed the highest accuracy at 74% and 70% for predicting patient's survival. Conclusion: Lower level LNs metastasis and LNR have important prognostic significance for predicting disease recurrence and survival in PGCs patients. These two factors were used as important features for constructing machine learning prediction model. Our machine learning model could predict PGCs patient's survival with a considerable level of accuracy.

Prediction Model on Delivery Time in Display FAB Using Survival Analysis (생존분석을 이용한 디스플레이 FAB의 반송시간 예측모형)

  • Han, Paul;Baek, Jun Geol
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.40 no.3
    • /
    • pp.283-290
    • /
    • 2014
  • In the flat panel display industry, to meet production target quantities and the deadline of production, the scheduler and dispatching systems are major production management systems which control the order of facility production and the distribution of WIP (Work In Process). Especially the delivery time is a key factor of the dispatching system for the time when a lot can be supplied to the facility. In this paper, we use survival analysis methods to identify main factors of the delivery time and to build the delivery time forecasting model. To select important explanatory variables, the cox proportional hazard model is used to. To make a prediction model, the accelerated failure time (AFT) model was used. Performance comparisons were conducted with two other models, which are the technical statistics model based on transfer history and the linear regression model using same explanatory variables with AFT model. As a result, the mean square error (MSE) criteria, the AFT model decreased by 33.8% compared to the statistics prediction model, decreased by 5.3% compared to the linear regression model. This survival analysis approach is applicable to implementing the delivery time estimator in display manufacturing. And it can contribute to improve the productivity and reliability of production management system.

Comparison of the Performance of Log-logistic Regression and Artificial Neural Networks for Predicting Breast Cancer Relapse

  • Faradmal, Javad;Soltanian, Ali Reza;Roshanaei, Ghodratollah;Khodabakhshi, Reza;Kasaeian, Amir
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.14
    • /
    • pp.5883-5888
    • /
    • 2014
  • Background: Breast cancer is the most common cancers in female populations. The exact cause is not known, but is most likely to be a combination of genetic and environmental factors. Log-logistic model (LLM) is applied as a statistical method for predicting survival and it influencing factors. In recent decades, artificial neural network (ANN) models have been increasingly applied to predict survival data. The present research was conducted to compare log-logistic regression and artificial neural network models in prediction of breast cancer (BC) survival. Materials and Methods: A historical cohort study was established with 104 patients suffering from BC from 1997 to 2005. To compare the ANN and LLM in our setting, we used the estimated areas under the receiver-operating characteristic (ROC) curve (AUC) and integrated AUC (iAUC). The data were analyzed using R statistical software. Results: The AUC for the first, second and third years after diagnosis are 0.918, 0.780 and 0.800 in ANN, and 0.834, 0.733 and 0.616 in LLM, respectively. The mean AUC for ANN was statistically higher than that of the LLM (0.845 vs. 0.744). Hence, this study showed a significant difference between the performance in terms of prediction by ANN and LLM. Conclusions: This study demonstrated that the ability of prediction with ANN was higher than with the LLM model. Thus, the use of ANN method for prediction of survival in field of breast cancer is suggested.

Credit Prediction Based on Kohonen Network and Survival Analysis (코호넨네트워크와 생존분석을 활용한 신용 예측)

  • Ha, Sung-Ho;Yang, Jeong-Won;Min, Ji-Hong
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.34 no.2
    • /
    • pp.35-54
    • /
    • 2009
  • The recent economic crisis not only reduces the profit of department stores but also incurs the significance losses caused by the increasing late-payment rate of credit cards. Under this pressure, the scope of credit prediction needs to be broadened from the simple prediction of whether this customer has a good credit or not to the accurate prediction of how much profit can be gained from this customer. This study classifies the delinquent customers of credit card in a Korean department store into homogeneous clusters. Using this information, this study analyzes the repayment patterns for each cluster and develops the credit prediction system to manage the delinquent customers. The model presented by this study uses Kohonen network, which is one of artificial neural networks of data mining technique, to cluster the credit delinquent customers into clusters. Cox proportional hazard model is also used, which is one of survival analysis used in medical statistics, to analyze the repayment patterns of the delinquent customers in each cluster. The presented model estimates the repayment period of delinquent customers for each cluster and introduces the influencing variables on the repayment pattern prediction. Although there are some differences among clusters, the variables about the purchasing frequency in a month and the average number of installment repayment are the most predictive variables for the repayment pattern. The accuracy of the presented system leaches 97.5%.

Optimization of Predictors of Ewing Sarcoma Cause-specific Survival: A Population Study

  • Cheung, Min Rex
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.10
    • /
    • pp.4143-4145
    • /
    • 2014
  • Background: This study used receiver operating characteristic curve to analyze Surveillance, Epidemiology and End Results (SEER) Ewing sarcoma (ES) outcome data. The aim of this study was to identify and optimize ES-specific survival prediction models and sources of survival disparities. Materials and Methods: This study analyzed socio-economic, staging and treatment factors available in the SEER database for ES. 1844 patients diagnosed between 1973-2009 were used for this study. For the risk modeling, each factor was fitted by a Generalized Linear Model to predict the outcome (bone and joint specific death, yes/no). The area under the receiver operating characteristic curve (ROC) was computed. Similar strata were combined to construct the most parsimonious models. Results: The mean follow up time (S.D.) was 74.48 (89.66) months. 36% of the patients were female. The mean (S.D.) age was 18.7 (12) years. The SEER staging has the highest ROC (S.D.) area of 0.616 (0.032) among the factors tested. We simplified the 4-layered risk levels (local, regional, distant, un-staged) to a simpler non-metastatic (I and II) versus metastatic (III) versus un-staged model. The ROC area (S.D.) of the 3-tiered model was 0.612 (0.008). Several other biologic factors were also predictive of ES-specific survival, but not the socio-economic factors tested here. Conclusions: ROC analysis measured and optimized the performance of ES survival prediction models. Optimized models will provide a more efficient way to stratify patients for clinical trials.

A Survival Prediction Model of Rats in Uncontrolled Acute Hemorrhagic Shock Using the Random Forest Classifier (랜덤 포리스트를 이용한 비제어 급성 출혈성 쇼크의 흰쥐에서의 생존 예측)

  • Choi, J.Y.;Kim, S.K.;Koo, J.M.;Kim, D.W.
    • Journal of Biomedical Engineering Research
    • /
    • v.33 no.3
    • /
    • pp.148-154
    • /
    • 2012
  • Hemorrhagic shock is a primary cause of deaths resulting from injury in the world. Although many studies have tried to diagnose accurately hemorrhagic shock in the early stage, such attempts were not successful due to compensatory mechanisms of humans. The objective of this study was to construct a survival prediction model of rats in acute hemorrhagic shock using a random forest (RF) model. Heart rate (HR), mean arterial pressure (MAP), respiration rate (RR), lactate concentration (LC), and peripheral perfusion (PP) measured in rats were used as input variables for the RF model and its performance was compared with that of a logistic regression (LR) model. Before constructing the models, we performed 5-fold cross validation for RF variable selection, and forward stepwise variable selection for the LR model to examine which variables were important for the models. For the LR model, sensitivity, specificity, accuracy, and area under the receiver operating characteristic curve (ROC-AUC) were 0.83, 0.95, 0.88, and 0.96, respectively. For the RF models, sensitivity, specificity, accuracy, and AUC were 0.97, 0.95, 0.96, and 0.99, respectively. In conclusion, the RF model was superior to the LR model for survival prediction in the rat model.

Reliability Prediction Based on Field Failure Data of Guided Missile (필드데이터 기반의 유도탄 신뢰도 예측)

  • Seo, Yangwoo;Lee, Kyeshin;Lee, Younho;Kim, Jeyong
    • Journal of Applied Reliability
    • /
    • v.18 no.3
    • /
    • pp.250-259
    • /
    • 2018
  • Purpose: Previously, missile reliability prediction is based on theoretical failure prediction model. It has shown that the predicted reliability is inadequate to real field data. Although an MTTF based reliability prediction method using real field data has recently been studied to overcome this issue. In this paper, we present a more realistic method, considering MTBF concept, to predict missile reliability. Methods: In this paper we proposed a modified survival model. This model is considering MTBF as its core concept, and failed missiles in the model are to be repaired and redeployed. We compared the modified model (MTBF) and the previous model (MTTF) in terms of fitness against the real failure data. Results: The reliability prediction result of MTBF based model is closer to fields failure data set than that of MTTF based model. Conclusion: The proposed MTBF concept is more fitted to real failure data of missile than MTTF concept. The methodology of this study can be applied to analyze field failure data of other similar missiles.

Determination of Survival of Gastric Cancer Patients With Distant Lymph Node Metastasis Using Prealbumin Level and Prothrombin Time: Contour Plots Based on Random Survival Forest Algorithm on High-Dimensionality Clinical and Laboratory Datasets

  • Zhang, Cheng;Xie, Minmin;Zhang, Yi;Zhang, Xiaopeng;Feng, Chong;Wu, Zhijun;Feng, Ying;Yang, Yahui;Xu, Hui;Ma, Tai
    • Journal of Gastric Cancer
    • /
    • v.22 no.2
    • /
    • pp.120-134
    • /
    • 2022
  • Purpose: This study aimed to identify prognostic factors for patients with distant lymph node-involved gastric cancer (GC) using a machine learning algorithm, a method that offers considerable advantages and new prospects for high-dimensional biomedical data exploration. Materials and Methods: This study employed 79 features of clinical pathology, laboratory tests, and therapeutic details from 289 GC patients whose distant lymphadenopathy was presented as the first episode of recurrence or metastasis. Outcomes were measured as any-cause death events and survival months after distant lymph node metastasis. A prediction model was built based on possible outcome predictors using a random survival forest algorithm and confirmed by 5×5 nested cross-validation. The effects of single variables were interpreted using partial dependence plots. A contour plot was used to visually represent survival prediction based on 2 predictive features. Results: The median survival time of patients with GC with distant nodal metastasis was 9.2 months. The optimal model incorporated the prealbumin level and the prothrombin time (PT), and yielded a prediction error of 0.353. The inclusion of other variables resulted in poorer model performance. Patients with higher serum prealbumin levels or shorter PTs had a significantly better prognosis. The predicted one-year survival rate was stratified and illustrated as a contour plot based on the combined effect the prealbumin level and the PT. Conclusions: Machine learning is useful for identifying the important determinants of cancer survival using high-dimensional datasets. The prealbumin level and the PT on distant lymph node metastasis are the 2 most crucial factors in predicting the subsequent survival time of advanced GC.

The Changes of Confidence, Accuracy and Knowledge of Medical Professionals after the Education for Survival Prediction in Terminally Ill Cancer Patients (말기암환자에 대한 여명 예측교육 후의 의료인의 자신감과 정확도 및 지식의 변화)

  • Park, Jun-Seok;Baek, Na-Young;Suh, Sang-Yeon;Kim, Yu-Il;Jeong, Hwee-Soo;Oh, Sang-Woo;Sung, Nak-Jin;Ahn, Hong-Yup;Seo, Ah-Ram;Lee, Yong-Joo
    • Journal of Hospice and Palliative Care
    • /
    • v.15 no.3
    • /
    • pp.155-161
    • /
    • 2012
  • Purpose: In this study, we evaluated the effects of training for survival prediction of terminally ill patients in terms of medical professionals' confidence, accuracy and knowledge of survival prediction. Methods: Twenty-nine participants completed a self-administered questionnaire where they scored their confidence, accuracy and knowledge of survival prediction before and after the training session. The training was provided in July 2009 at a university hospital located in Gyeonggi province, Republic of Korea. The participants were instructed by a professor of family medicine specialized in hospice palliative medicine to predict survival of a case using the palliative prognostic score and objective prognostic score. The training was provided in the form of a PowerPoint presentation for 40 minutes. Results: Participants' confidence in survival prediction significantly increased from $4.00{\pm}1.73$ ($mean{\pm}SD$) (0~10, visual analogue scale) to $5.83{\pm}1.71$ after the training (P<0.001). Before training, participant's level of confidence significantly correlated with their age (P=0.04). The training significantly improved the correlation between the confidence level and the number of terminal cancer patients whom they have experienced (P=0.005 before training, P=0.017 after training). Participant's accuracy in survival prediction also significantly improved from 14 of 29 (48%) to 27 of 29 (93.1%) (P<0.001). The change in knowledge of survival prediction was too small to be statistically analyzed. Conclusion: After training, the confidence and accuracy scores significantly improved. Further study with a greater number of participants is needed to generalize this finding.