• Title/Summary/Keyword: Regression analysis method

Search Result 4,623, Processing Time 0.041 seconds

Application of Crossover Analysis-logistic Regression in the Assessment of Gene- environmental Interactions for Colorectal Cancer

  • Wu, Ya-Zhou;Yang, Huan;Zhang, Ling;Zhang, Yan-Qi;Liu, Ling;Yi, Dong;Cao, Jia
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.2031-2037
    • /
    • 2012
  • Background: Analysis of gene-gene and gene-environment interactions for complex multifactorial human disease faces challenges regarding statistical methodology. One major difficulty is partly due to the limitations of parametric-statistical methods for detection of gene effects that are dependent solely or partially on interactions with other genes or environmental exposures. Based on our previous case-control study in Chongqing of China, we have found increased risk of colorectal cancer exists in individuals carrying a novel homozygous TT at locus rs1329149 and known homozygous AA at locus rs671. Methods: In this study, we proposed statistical method-crossover analysis in combination with logistic regression model, to further analyze our data and focus on assessing gene-environmental interactions for colorectal cancer. Results: The results of the crossover analysis showed that there are possible multiplicative interactions between loci rs671 and rs1329149 with alcohol consumption. Multifactorial logistic regression analysis also validated that loci rs671 and rs1329149 both exhibited a multiplicative interaction with alcohol consumption. Moreover, we also found additive interactions between any pair of two factors (among the four risk factors: gene loci rs671, rs1329149, age and alcohol consumption) through the crossover analysis, which was not evident on logistic regression. Conclusions: In conclusion, the method based on crossover analysis-logistic regression is successful in assessing additive and multiplicative gene-environment interactions, and in revealing synergistic effects of gene loci rs671 and rs1329149 with alcohol consumption in the pathogenesis and development of colorectal cancer.

UNCERTAINTY ANALYSIS OF DATA-BASED MODELS FOR ESTIMATING COLLAPSE MOMENTS OF WALL-THINNED PIPE BENDS AND ELBOWS

  • Kim, Dong-Su;Kim, Ju-Hyun;Na, Man-Gyun;Kim, Jin-Weon
    • Nuclear Engineering and Technology
    • /
    • v.44 no.3
    • /
    • pp.323-330
    • /
    • 2012
  • The development of data-based models requires uncertainty analysis to explain the accuracy of their predictions. In this paper, an uncertainty analysis of the support vector regression (SVR) model, which is a data-based model, was performed because previous research showed that the SVR method accurately estimates the collapse moments of wall-thinned pipe bends and elbows. The uncertainty analysis method used in this study was an analytic uncertainty analysis method, and estimates with a 95% confidence interval were obtained for 370 test data points. From the results, the prediction interval (PI) was very narrow, which means that the predicted values are quite accurate. Therefore, the proposed SVR method can be used effectively to assess and validate the integrity of the wall-thinned pipe bends and elbows.

A study on the properties of sensitivity analysis in principal component regression and latent root regression (주성분회귀와 고유값회귀에 대한 감도분석의 성질에 대한 연구)

  • Shin, Jae-Kyoung;Chang, Duk-Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.2
    • /
    • pp.321-328
    • /
    • 2009
  • In regression analysis, the ordinary least squares estimates of regression coefficients become poor, when the correlations among predictor variables are high. This phenomenon, which is called multicollinearity, causes serious problems in actual data analysis. To overcome this multicollinearity, many methods have been proposed. Ridge regression, shrinkage estimators and methods based on principal component analysis (PCA) such as principal component regression (PCR) and latent root regression (LRR). In the last decade, many statisticians discussed sensitivity analysis (SA) in ordinary multiple regression and same topic in PCR, LRR and logistic principal component regression (LPCR). In those methods PCA plays important role. Many statisticians discussed SA in PCA and related multivariate methods. We introduce the method of PCR and LRR. We also introduce the methods of SA in PCR and LRR, and discuss the properties of SA in PCR and LRR.

  • PDF

How to identify fake images? : Multiscale methods vs. Sherlock Holmes

  • Park, Minsu;Park, Minjeong;Kim, Donghoh;Lee, Hajeong;Oh, Hee-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.6
    • /
    • pp.583-594
    • /
    • 2021
  • In this paper, we propose wavelet-based procedures to identify the difference between images, including portraits and handwriting. The proposed methods are based on a novel combination of multiscale methods with a regularization technique. The multiscale method extracts the local characteristics of an image, and the distinct features are obtained through the regularized regression of the local characteristics. The regularized regression approach copes with the high-dimensional problem to build the relation between the local characteristics. Lytle and Yang (2006) introduced the detection method of forged handwriting via wavelets and summary statistics. We expand the scope of their method to the general image and significantly improve the results. We demonstrate the promising empirical evidence of the proposed method through various experiments.

Comparative Analysis of Determination of Method Location between Classes (클래스 간 메소드 위치 결정 방법의 비교)

  • Jung, Young-Ae;Park, Young-B.
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.12
    • /
    • pp.80-88
    • /
    • 2006
  • In Object-Oriented Paradigm, various cohesion measurements have been studied taking into account reference relation among components - like attributes and methods - that belong to a class. In addition, a number of methods have taken into research utilizing manual analysis, that is performed by developer's intuition and experience, and automatic analysis in refactoring field. The verification of objective criteria is demanded in order to process automatic refactoring. In this paper, we propose a method exploiting logistic regression and neural network for analysis of the relationship between six factors considering reference relation and method location among classes. Experimental results demonstrate that the logistic regression predicts the results up to 97% and the neural network predicts the outcomes up to 90%. Hence, we conclude that the logistic regression based method is more effective to predict the method location. Moreover, more than 90% of experimental results from both methods show that the six factors used in Move Method in refactoring are suitable to be used as an objective criteria.

  • PDF

Trend Analysis of Extreme Precipitation Using Quantile Regression (Quantile 회귀분석을 이용한 극대강수량 자료의 경향성 분석)

  • So, Byung-Jin;Kwon, Hyun-Han;An, Jung-Hee
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.8
    • /
    • pp.815-826
    • /
    • 2012
  • The underestimating trend using existing ordinary regression (OR) based trend analysis has been a well-known problem. The existing OR method based on least squares approximate the conditional mean of the response variable given certain values of the time t, and the usual assumption of the OR method is normality, that is the distribution of data are not dissimilar form a normal distribution. In this regard, this study proposed a quantile regression that aims at estimating either the conditional median or other quantiles of the response variable. This study assess trend in annual daily maximum rainfall series over 64 weather stations through both in OR and QR approach. The QR method indicates that 47 stations out of 67 weather stations are a strong upward trend at 5% significance level while OR method identifies a significant trend only at 13 stations. This is mainly because the OR method is estimating the condition mean of the response variable. Unlike the OR method, the QR method allows us flexibly to detect the trends since the OR is designed to estimate conditional quantiles of the response variable. The proposed QR method can be effectively applied to estimate hydrologic trend for either non-normal data or skewed data.

Application Method of Logistic Regression Analysis for Annoyance Prediction Model Based on Predicted Noise Level (예측소음도를 이용한 어노이언스 예측모델을 위한 로지스틱 회귀분석의 적용방법)

  • Son, Jin-Hee;Lee, Kun;Choung, Tae-Ryang;Chang, Seo-Il
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.20 no.6
    • /
    • pp.555-561
    • /
    • 2010
  • Predicted noise level has been used to assess the annoyance response since noise map was generalized and being the normal method to assess the environmental noise. Unfortunately using predicted noise level to derive the annoyance prediction curve caused some problems. The data have to be grouped manually to use the annoyance prediction curve. The aim of this paper is to propose the method to handle the predicted noise level and the survey data for annoyance prediction curve. This paper used the percentage of persons annoyed(%A) and the percentage of persons highly annoyed as the descriptor of noise annoyance in a population. The logistic regression method was used for deriving annoyance prediction curve. It is concluded that the method of dichotomizing data and logistic regression was suitable to handle the predicted noise level and survey data.

Analysis of the relationship between regulation compliance and occupational injuries - Focusing on logistic and poisson regression analysis - (규제 순응도와 산업재해 발생 수준간의 관계 분석 - 로지스틱 회귀분석과 포아송 회귀분석을 중심으로 -)

  • Rhee, Kyung-Yong;Kim, Ki-Sik;Yoon, Young-Shik
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.2
    • /
    • pp.9-20
    • /
    • 2013
  • OSHA(Occupational Safety and Health Act) generally regulates employer's business principles in the workplace to maintain safety environment. This act has the fundamental purpose to protect employee's safety and health in the workplace by reducing industrial accidents. Authors tried to investigate the correlation between 'occupational injuries and illnesses' and level of regulation compliance using Survey on Current Status of Occupational Safety & Health data by the various statistical methods, such as generalized regression analysis, logistic regression analysis and poison regression analysis in order to compare the results of those methods. The results have shown that the significant affecting compliance factors were different among those statistical methods. This means that specific interpretation should be considered based on each statistical method. In the future, relevant statistical technique will be developed considering the distribution type of occupational injuries.

Fast robust variable selection using VIF regression in large datasets (대형 데이터에서 VIF회귀를 이용한 신속 강건 변수선택법)

  • Seo, Han Son
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.463-473
    • /
    • 2018
  • Variable selection algorithms for linear regression models of large data are considered. Many algorithms are proposed focusing on the speed and the robustness of algorithms. Among them variance inflation factor (VIF) regression is fast and accurate due to the use of a streamwise regression approach. But a VIF regression is susceptible to outliers because it estimates a model by a least-square method. A robust criterion using a weighted estimator has been proposed for the robustness of algorithm; in addition, a robust VIF regression has also been proposed for the same purpose. In this article a fast and robust variable selection method is suggested via a VIF regression with detecting and removing potential outliers. A simulation study and an analysis of a dataset are conducted to compare the suggested method with other methods.

A New Image Analysis Method based on Regression Manifold 3-D PCA (회귀 매니폴드 3-D PCA 기반 새로운 이미지 분석 방법)

  • Lee, Kyung-Min;Lin, Chi-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.2
    • /
    • pp.103-108
    • /
    • 2022
  • In this paper, we propose a new image analysis method based on regression manifold 3-D PCA. The proposed method is a new image analysis method consisting of a regression analysis algorithm with a structure designed based on an autoencoder capable of nonlinear expansion of manifold 3-D PCA and PCA for efficient dimension reduction when entering large-capacity image data. With the configuration of an autoencoder, a regression manifold 3-DPCA, which derives the best hyperplane through three-dimensional rotation of image pixel values, and a Bayesian rule structure similar to a deep learning structure, are applied. Experiments are performed to verify performance. The image is improved by utilizing the fine dust image, and accuracy performance evaluation is performed through the classification model. As a result, it can be confirmed that it is effective for deep learning performance.