DOI QR코드

DOI QR Code

Box-Cox Power Transformation Using R

  • Baek, Hoh Yoo (Division of Big Data Financial Statistics, Wonkwang University)
  • Received : 2020.06.11
  • Accepted : 2020.06.17
  • Published : 2020.06.30

Abstract

If normality of an observed data is not a viable assumption, we can carry out normal-theory analyses by suitable transforming data. Power transformation by Box and Cox, one of the transformation methods, is derived the power which maximized the likelihood function. But it doesn't induces the closed form in mathematical analysis. In this paper, we compose some R the syntax of which is easier than other statistical packages for deriving the power with using numerical methods. Also, by using R, we show the transformed data approximately distributed the normal through Q-Q plot in univariate and bivariate cases with some examples. Finally, we present the value of a goodness-of-fit statistic(AD) and its p-value for normal distribution. In the similar procedure, this method can be extended to more than bivariate case.

Keywords

References

  1. J. H. Albert, Bayesian Computation Using Minitab, Duxbury Press, Belmont, CA, 1996.
  2. D. F. Andrews, R. Gnanadesikan, and J. L. Warner, Transformation of Multivariate Data, Biometrics, Vol.27, No. 4, pp. 825-840, 1971. https://doi.org/10.2307/2528821
  3. D. A. Berry, Statistics : A Bayesian Perspective, Duzbury Press, Belmont, CA., 1996.
  4. G. E. Box and D. R. Cox, An Analysis of Transformations, Journal of the Royal Statistical Society, Ser. B, Vol. 26, pp. 211-252, 1964.
  5. F. Hernandez and R. A. Johnson, The Large-Sample Behavior of Transformations to Normality, Journal of the American Statistical Association, Vol. 75, No. 352, pp. 855-861, 1980. https://doi.org/10.1080/01621459.1980.10477563
  6. B. H. Kim, H. Y. Baek, T. R. Park, H. S. Oh, and I. H. Jang, Bayesian statistical calculation, Free academy.(in Korean), 2001.
  7. H. J. Kim, C. Park, H. Y. Woon, and Y. G. Moon, A Comparative Study on the Parameter Estimation of Bivariate Regular Population Using Variation Coefficients, Journal of Korean Data Analysis Society, Vol. 3, No. 3, pp. 255-265.(in Korean), 2001.
  8. J. M. Lee and H. Y. Baek, Minitab macros for application of Bayes' law, Journal of the Korean Data Analysis Society, Vol. 8, No. 4, pp. 1585-1599.(in Korean), 2006.
  9. Minitab Inc, MINITAB, User's Guide Release 14 for Windows, 2003.
  10. A. Richard, D. Johnson, and W. Winchern, Applied Multivariate Statistical Analysis, 6th edition, Peason Education, Inc, 2007.
  11. J. W. Song, Application of multiple substitution method using latent variable for nonnormal variable, Journal of the Korean Data Analysis Society, Vol. 11, No. 3B, 1377-1387.(in Korean), 2009.
  12. I. K. Yeo and R. A. Johnson, A New Family of Power Transformations to Improve Normality or Symmetry, Biometrika, Vol. 87, No. 4, pp. 954-959, 2000. https://doi.org/10.1093/biomet/87.4.954
  13. S. M. Yoo, G. H. Kim, and D. H. Kim, Stock price normalization process, Journal of the Korean Data Analysis Society, Vol. 8, No. 2, pp. 615-624.(in Korean), 2006.