Correlation and Simple Linear Regression

상관성과 단순선형회귀분석

  • Pak, Son-Il (College of Veterinary Medicine and Institute of Veterinary Science, Kangwon National University) ;
  • Oh, Tae-Ho (College of Veterinary Medicine, Kyungpook National University)
  • 박선일 (강원대학교 수의과대학 및 동물의학종합연구소) ;
  • 오태호 (경북대학교 수의과대학)
  • Accepted : 2010.06.14
  • Published : 2010.08.30

Abstract

Correlation is a technique used to measure the strength or the degree of closeness of the linear association between two quantitative variables. Common misuses of this technique are highlighted. Linear regression is a technique used to identify a relationship between two continuous variables in mathematical equations, which could be used for comparison or estimation purposes. Specifically, regression analysis can provide answers for questions such as how much does one variable change for a given change in the other, how accurately can the value of one variable be predicted from the knowledge of the other. Regression does not give any indication of how good the association is while correlation provides a measure of how well a least-squares regression line fits the given set of data. The better the correlation, the closer the data points are to the regression line. In this tutorial article, the process of obtaining a linear regression relationship for a given set of bivariate data was described. The least square method to obtain the line which minimizes the total error between the data points and the regression line was employed and illustrated. The coefficient of determination, the ratio of the explained variation of the values of the independent variable to total variation, was described. Finally, the process of calculating confidence and prediction interval was reviewed and demonstrated.

Keywords

References

  1. Bewick V, Cheek L, Ball J. Statistical review 7: correlation and regression. Crit Care 2003; 7: 451-459 https://doi.org/10.1186/cc2401
  2. Bland M, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986; I: 307-310.
  3. Daniel WW. Applied nonparametric statistics. 2nd ed. Thompson Information Publishing Group, Boston, 1990.
  4. Fleiss JL. Statistical methods for rates and proportions. 2nd ed. pp. 212-236, Wiley & Sons, New York, NY, 1981.
  5. Glantz SA, Slinker BK. Primer of applied regression and analysis of variance. McGraw-Hill, New York, 1990.
  6. Zar JH. Biostatistical analysis. 4th ed. pp. 381-386, Prentice-Hall International, UK, 1999.
  7. Zou KH, Tuncali K, Silverman SG. Correlation and simple linear regression. Radiology 2003; 227: 617-622. https://doi.org/10.1148/radiol.2273011499