• 제목/요약/키워드: univariate method

검색결과 268건 처리시간 0.024초

Survival Analysis and Prognostic Factors for Colorectal Cancer Patients in Malaysia

  • Hassan, Muhammad Radzi Abu;Suan, Mohd Azri Mohd;Soelar, Shahrul Aiman;Mohammed, Noor Syahireen;Ismail, Ibtisam;Ahmad, Faizah
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권7호
    • /
    • pp.3575-3581
    • /
    • 2016
  • Background: Cancer survival analysis is an essential indicator for effective early detection and improvements in cancer treatment. This study was undertaken to document colorectal cancer survival and associated prognostic factors in Malaysians. Materials and Methods: All data were retrieved from the National Cancer Patient Registry-Colorectal Cancer. Only cases with confirmed diagnosis through histology between the year 2008 and 2009 were included. Retrieved data include socio-demographic information, pathological features and treatment received. Survival curves were plotted using the Kaplan-Meier method. Univariate analysis of all variables was then made using the Log-rank test. All significant factors that influenced survival of patients were further analysed in a multivariate analysis using Cox' regression. Results: Total of 1,214 patients were included in the study. The overall 3- and 5-year survival rates were 59.1% and 48.7%, respectively. Patients with localized tumours had better prognosis compared to those with advanced stage cancer. In univariate analysis, staging at diagnosis (p<0.001), primary tumour size (p<0.001), involvement of lymph nodes (p<0.001) and treatment modalities (p=0.001) were found to be predictors of survival. None of the socio-demographic characteristics were found to exert any influence. In Cox regression analysis, staging at diagnosis (p<0.001), primary tumour size (p<0.001), involvement of lymph nodes (p<0.001) and treatment modalities (p<0.001) were determined as independent prognostic factors of survival after adjusted for age, gender and ethnicity. Conclusions: The overall survival rate for colorectal cancer patients in Malaysia is similar to those in other Asian countries, with staging at diagnosis, primary tumor size, involvement of lymph node and treatment modalities having significant effects. More efforts are needed to improve national survival rates in future.

수산기업의 부실화 요인 및 예측에 관한 연구 (A Study on the Distress Prediction in the Fishery Industry)

  • 이윤원;장창익;홍재범
    • 한국수산경영학회:학술대회논문집
    • /
    • 한국수산경영학회 2007년도 추계학술발표회 및 심포지엄
    • /
    • pp.167-184
    • /
    • 2007
  • The objectives of this paper are to identify the causes of the corporate distress and to develop a distress prediction model with the financial information in fishery industry. In this study, the corporate distress is defined as economic failure and technical insolvency. Economic failure occurs by reduction, shut-down, or change of the business and technical insolvency results from failure to pay the financial debt of companies. The 33 distressed firms from 1991 to 2003 were composed by 14 economic failure companies, 15 technical insolvency companies. 4 companies applied to the both cases. The analysis of distress prediction of fishery companies were accomplished according to the distress definition. The analysis was carried out as two steps. The first step was the univariate analysis, which was used for checking the prediction power of individual financial variable. The t-test is used to identify the differences in financial variables between the distressed group and the non-distressed group. The second step was to develop distress prediction model with logistic regression. The variables showed the significant difference in univariate analysis were selected as the prediction variables. The financial ratios, used in the logistic regression model, were selected by backward elimination method. To test stability of the distress prediction model, the whole sample was divided as three sub-samples, period 1(1990$\sim$1993), period 2(1994$\sim$1997), period 3(1998$\sim$2002). The final model built from whole sample appled each three sub-samples. The results of the logistic analysis were as follows. the growth, profitability, stability ratios showed the significant effect on the distress. the some different result was found in the sub-sample (economic failure and technical insolvency). The growth and the profitability were important to predict the economic failure. The profitability and the activity were important to predict technical insolvency. It means that profitability is the really important factor to the fishery companies.

  • PDF

수산기업의 부실화 요인과 그 예측에 관한 연구 (A Study on the Distress Prediction in the Fishery Industry)

  • 장창익;이윤원;홍재범
    • 수산경영론집
    • /
    • 제39권2호
    • /
    • pp.61-79
    • /
    • 2008
  • The objectives of this paper are to identify the causes of the corporate distress and to develop a distress prediction model with the financial information in fishery industry. In this study, the corporate distress is defined as economic failure and technical insolvency. Economic failure occurs by reduction, shut - down, or change of the business and technical insolvency results from failure to pay the financial debt of companies. The 33 distressed firms from 1991 to 2003 were composed by 14 economic failure companies, 15 technical insolvency companies. 4 companies applied to the both cases. The analysis of distress prediction of fishery companies were accomplished according to the distress definition. The analysis was carried out as two steps. The first step was the univariate analysis, which was used for checking the prediction power of individual financial variable. The t - test is used to identify the differences in financial variables between the distressed group and the non - distressed group. The second step was to develop distress prediction model with logistic regression. The variables showed the significant difference in univariate analysis were selected as the prediction variables. The financial ratios, used in the logistic regression model, were selected by backward elimination method. To test stability of the distress prediction model, the whole sample was divided as three sub-samples, period 1(1990 - 1993), period 2(1994 - 1997), period 3(1998 - 2002). The final model built from whole sample appled each three sub - samples. The results of the logistic analysis were as follows. the growth, profitability, stability ratios showed the significant effect on the distress. the some different result was found in the sub - sample (economic failure and technical insolvency). The growth and the profitability were important to predict the economic failure. The profitability and the activity were important to predict technical insolvency. It means that profitability is the really important factor to the fishery companies.

  • PDF

단변량 및 다변량 함수 데이터에 대한 분산분석의 활용 (Application of functional ANOVA and functional MANOVA)

  • 김미정
    • 응용통계연구
    • /
    • 제35권5호
    • /
    • pp.579-591
    • /
    • 2022
  • 함수 데이터는 다양한 분야에서 수집되고 있으며, 집단 간의 함수 데이터를 비교해야하는 경우가 종종 발생한다. 이럴 경우 점별 분산분석 방법을 이용하여 설명하기에는 무리가 있으며, 통합된 결과를 제시할 필요가 있다. 이에 대한 다양한 연구가 제안되었으며, 최근에 R 패키지 fdANOVA로 구현되었다. 이 논문에서 우선 분산분석 및 다변량 분산분석을 설명하고, 최근에 제안된 다양한 단변량 및 다변량 함수 데이터 분산분석을 설명하고자 한다. 또한 R 패키지 fdANOVA의 사용 방법을 설명하고, 이 패키지를 이용하여 서울과 부산 지역의 주별 기온을 단변량 함수 데이터 분산분석을 통해 비교하고, 손글씨 이미지를 다변량 함수 데이터로 변환하여 다변량 함수 데이터 분산분석을 이용하여 비교하고자 한다.

Bayesian analysis of longitudinal traits in the Korea Association Resource (KARE) cohort

  • Chung, Wonil;Hwang, Hyunji;Park, Taesung
    • Genomics & Informatics
    • /
    • 제20권2호
    • /
    • pp.16.1-16.12
    • /
    • 2022
  • Various methodologies for the genetic analysis of longitudinal data have been proposed and applied to data from large-scale genome-wide association studies (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with traits of interest and to detect SNP-time interactions. We recently proposed a grid-based Bayesian mixed model for longitudinal genetic data and showed that our Bayesian method increased the statistical power compared to the corresponding univariate method and well detected SNP-time interactions. In this paper, we further analyze longitudinal obesity-related traits such as body mass index, hip circumference, waist circumference, and waist-hip ratio from Korea Association Resource data to evaluate the proposed Bayesian method. We first conducted GWAS analyses of cross-sectional traits and combined the results of GWAS analyses through a meta-analysis based on a trajectory model and a random-effects model. We then applied our Bayesian method to a subset of SNPs selected by meta-analysis to further discover SNPs associated with traits of interest and SNP-time interactions. The proposed Bayesian method identified several novel SNPs associated with longitudinal obesity-related traits, and almost 25% of the identified SNPs had significant p-values for SNP-time interactions.

PERFORMANCE OF THE AUTOREGRESSIVE METHOD IN LONG-TERM PREDICTION OF SUNSPOT NUMBER

  • Chae, Jongchul;Kim, Yeon Han
    • 천문학회지
    • /
    • 제50권2호
    • /
    • pp.21-27
    • /
    • 2017
  • The autoregressive method provides a univariate procedure to predict the future sunspot number (SSN) based on past record. The strength of this method lies in the possibility that from past data it yields the SSN in the future as a function of time. On the other hand, its major limitation comes from the intrinsic complexity of solar magnetic activity that may deviate from the linear stationary process assumption that is the basis of the autoregressive model. By analyzing the residual errors produced by the method, we have obtained the following conclusions: (1) the optimal duration of the past time for the forecast is found to be 8.5 years; (2) the standard error increases with prediction horizon and the errors are mostly systematic ones resulting from the incompleteness of the autoregressive model; (3) there is a tendency that the predicted value is underestimated in the activity rising phase, while it is overestimated in the declining phase; (5) the model prediction of a new Solar Cycle is fairly good when it is similar to the previous one, but is bad when the new cycle is much different from the previous one; (6) a reasonably good prediction of a new cycle can be made using the AR model 1.5 years after the start of the cycle. In addition, we predict the next cycle (Solar Cycle 25) will reach the peak in 2024 at the activity level similar to the current cycle.

성능 모멘트 적분법을 이용한 제작공차에 의해 발생하는 스피커 성능함수의 확률분포 특성 예측 (Prediction of Probabilistic Distribution of a Loudspeaker's Performance Due to Manufacturing Tolerances by Performance Moment Integration Method)

  • 강병수;백종현;김동훈
    • 한국자기학회지
    • /
    • 제26권3호
    • /
    • pp.81-85
    • /
    • 2016
  • 본 논문에서는 제작공차에 의해 전기기기 및 소자 관련 제품에서 발생하는 성능함수의 변동특성을 예측하기 위해서 성능 모멘트 적분법을 도입하였다. 성능함수의 확률론적 분포특성을 판단할 수 있는 평균과 분산을 효율적으로 계산하기 위해서 정규분포로 변환된 성능함수 공간과 혼합형 평균치 기법을 채용하였다. 제안된 기법의 수치적인 효율성과 정밀도를 검증하기 위해서 간단한 수학예제와 스피커 모델에 적용하여 예측된 성능함수의 확률분포 특성을 차원감소법과 몬테카를로 수치모사법의 결과와 비교하였다.

붓스트랩을 활용한 최적 절사공간중위수 추정량 (A Trimmed Spatial Median Estimator Using Bootstrap Method)

  • 이동희;정병철
    • 응용통계연구
    • /
    • 제23권2호
    • /
    • pp.375-382
    • /
    • 2010
  • 본 논문에서는 다변량 자료의 위치모수에 대한 로버스트 추정량으로 공간중위수에 대한 절사 추정량을 제안하였다. 최적절사율은 붓스트랩 방법을 이용하여 결정하였으며, 이중붓스트랩을 활용하여 추정된 절사공간중위수의 공분산행렬을 추정하였다. 모의실험 결과 붓스트랩 방법에 의한 절사공간중위수는 자료가 다변량 코시분포를 따르는 경우 기존 공간중위수에 비하여 작은 평균제곱오차를 보여 효율적인 추정량으로 나타났다. 아울러 이중붓스트랩을 이용한 절사추정량의 공분산행렬 추정량은 단순붓스트랩 방법에 의하여 추정된 공분산행렬이 갖는 과소추정의 문제를 해결하는 방법으로 나타났다.

Prediction of the $24^{th}$ Solar Maximum Based on the Principal Component-and-Autoregression method

  • Chae, Jong-Chul;Oh, Seung-Jun
    • 천문학회보
    • /
    • 제36권2호
    • /
    • pp.100.1-100.1
    • /
    • 2011
  • Everybody wants to see the future, but nobody does for sure. Reliably forecasting the solar activity in the near future looks like an easy task, but in fact still remains one of difficult problems in the solar-terrestrial research. We have sought for good univariate methods that can predict future smoothed sunspot numbers reasonably well based on past smoothed sunspot number data only. Here we consider a specific method we call principal component-and-autoregression (PCAR) method. The variation of sunspot number during a period of finite duration (past) before an epoch (present) is modeled by a linear combination of a small number of dominant principal components, and this model is extended to the period (future) beyond the epoch using the autoregressive model of finite order. From the application of this method, we find that the $24^{th}$ solar maximum is likely to occur near the end of the year 2013 (and there is a possibility that it occurs earlier near the start of 2013), and to have a peak sunspot number of about 86, indicating that the activity of the $24^{th}$ cycle will be weaker than the average. We will discuss how much this estimate is reliable.

  • PDF

SHM-based probabilistic representation of wind properties: Bayesian inference and model optimization

  • Ye, X.W.;Yuan, L.;Xi, P.S.;Liu, H.
    • Smart Structures and Systems
    • /
    • 제21권5호
    • /
    • pp.601-609
    • /
    • 2018
  • The estimated probabilistic model of wind data based on the conventional approach may have high discrepancy compared with the true distribution because of the uncertainty caused by the instrument error and limited monitoring data. A sequential quadratic programming (SQP) algorithm-based finite mixture modeling method has been developed in the companion paper and is conducted to formulate the joint probability density function (PDF) of wind speed and direction using the wind monitoring data of the investigated bridge. The established bivariate model of wind speed and direction only represents the features of available wind monitoring data. To characterize the stochastic properties of the wind parameters with the subsequent wind monitoring data, in this study, Bayesian inference approach considering the uncertainty is proposed to update the wind parameters in the bivariate probabilistic model. The slice sampling algorithm of Markov chain Monte Carlo (MCMC) method is applied to establish the multi-dimensional and complex posterior distribution which is analytically intractable. The numerical simulation examples for univariate and bivariate models are carried out to verify the effectiveness of the proposed method. In addition, the proposed Bayesian inference approach is used to update and optimize the parameters in the bivariate model using the wind monitoring data from the investigated bridge. The results indicate that the proposed Bayesian inference approach is feasible and can be employed to predict the bivariate distribution of wind speed and direction with limited monitoring data.