• 제목/요약/키워드: non-normality

검색결과 105건 처리시간 0.03초

A comparison of tests for homoscedasticity using simulation and empirical data

  • Anastasios Katsileros;Nikolaos Antonetsis;Paschalis Mouzaidis;Eleni Tani;Penelope J. Bebeli;Alex Karagrigoriou
    • Communications for Statistical Applications and Methods
    • /
    • 제31권1호
    • /
    • pp.1-35
    • /
    • 2024
  • The assumption of homoscedasticity is one of the most crucial assumptions for many parametric tests used in the biological sciences. The aim of this paper is to compare the empirical probability of type I error and the power of ten parametric and two non-parametric tests for homoscedasticity with simulations under different types of distributions, number of groups, number of samples per group, variance ratio and significance levels, as well as through empirical data from an agricultural experiment. According to the findings of the simulation study, when there is no violation of the assumption of normality and the groups have equal variances and equal number of samples, the Bhandary-Dai, Cochran's C, Hartley's Fmax, Levene (trimmed mean) and Bartlett tests are considered robust. The Levene (absolute and square deviations) tests show a high probability of type I error in a small number of samples, which increases as the number of groups rises. When data groups display a nonnormal distribution, researchers should utilize the Levene (trimmed mean), O'Brien and Brown-Forsythe tests. On the other hand, if the assumption of normality is not violated but diagnostic plots indicate unequal variances between groups, researchers are advised to use the Bartlett, Z-variance, Bhandary-Dai and Levene (trimmed mean) tests. Assessing the tests being considered, the test that stands out as the most well-rounded choice is the Levene's test (trimmed mean), which provides satisfactory type I error control and relatively high power. According to the findings of the study and for the scenarios considered, the two non-parametric tests are not recommended. In conclusion, it is suggested to initially check for normality and consider the number of samples per group before choosing the most appropriate test for homoscedasticity.

Robustness of Bayes forecast to Non-normality

  • Bansal, Ashok K.
    • Journal of the Korean Statistical Society
    • /
    • 제7권1호
    • /
    • pp.11-16
    • /
    • 1978
  • Bayesian procedures are in vogue to revise the parameter estimates of the forecasting model in the light of actual time series data. In this paper, we study the Bayes forecast for demand and the risk when (a) 'noise' and (b) mean demand rate in a constant process model have moderately non-normal probability distributions.

  • PDF

A spatial heterogeneity mixed model with skew-elliptical distributions

  • Farzammehr, Mohadeseh Alsadat;McLachlan, Geoffrey J.
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.373-391
    • /
    • 2022
  • The distribution of observations in most econometric studies with spatial heterogeneity is skewed. Usually, a single transformation of the data is used to approximate normality and to model the transformed data with a normal assumption. This assumption is however not always appropriate due to the fact that panel data often exhibit non-normal characteristics. In this work, the normality assumption is relaxed in spatial mixed models, allowing for spatial heterogeneity. An inference procedure based on Bayesian mixed modeling is carried out with a multivariate skew-elliptical distribution, which includes the skew-t, skew-normal, student-t, and normal distributions as special cases. The methodology is illustrated through a simulation study and according to the empirical literature, we fit our models to non-life insurance consumption observed between 1998 and 2002 across a spatial panel of 103 Italian provinces in order to determine its determinants. Analyzing the posterior distribution of some parameters and comparing various model comparison criteria indicate the proposed model to be superior to conventional ones.

데이터 증가를 통한 선형 모델의 일반화 성능 개량 (중심극한정리를 기반으로) (Improvement of generalization of linear model through data augmentation based on Central Limit Theorem)

  • 황두환
    • 지능정보연구
    • /
    • 제28권2호
    • /
    • pp.19-31
    • /
    • 2022
  • 기계학습 모델 구축 간 트레이닝 데이터를 활용하며, 훈련 간 사용되지 않은 테스트 데이터를 활용하여 모델의 정확도와 일반화 성능을 판단한다. 일반화 성능이 낮은 모델의 경우 새롭게 받아들이게 되는 데이터에 대한 예측 정확도가 현저히 감소하게 되며 이러한 현상을 두고 모델이 과적합 되었다고 한다. 본 연구는 중심극한정리를 기반으로 데이터를 생성 및 기존의 훈련용 데이터와 결합하여 새로운 훈련용 데이터를 구성하고 데이터의 정규성을 증가시킴과 동시에 이를 활용하여 모델의 일반화 성능을 증가시키는 방법에 대한 것이다. 이를 위해 중심극한정리의 성질을 활용해 데이터의 각 특성별로 표본평균 및 표준편차를 활용하여 데이터를 생성하였고, 새로운 훈련용 데이터의 정규성 증가 정도를 파악하기 위하여 Kolmogorov-Smirnov 정규성 검정을 진행한 결과, 새로운 훈련용 데이터가 기존의 데이터에 비해 정규성이 증가하였음을 확인할 수 있었다. 일반화 성능은 훈련용 데이터와 테스트용 데이터에 대한 예측 정확도의 차이를 통해 측정하였다. 새롭게 생성된 데이터를 K-Nearest Neighbors(KNN), Logistic Regression, Linear Discriminant Analysis(LDA)에 적용하여 훈련시키고 일반화 성능 증가정도를 파악한 결과, 비모수(non-parametric) 기법인 KNN과 모델 구성 간 정규성을 가정으로 갖는 LDA의 경우에 대하여 일반화 성능이 향상되었음을 확인할 수 있었다.

Asymptotic Normality for Threshold-Asymmetric GARCH Processes of Non-Stationary Cases

  • Park, J.A.;Hwang, S.Y.
    • Communications for Statistical Applications and Methods
    • /
    • 제18권4호
    • /
    • pp.477-483
    • /
    • 2011
  • This article is concerned with a class of threshold-asymmetric GARCH models both for stationary case and for non-stationary case. We investigate large sample properties of estimators from QML(quasi-maximum likelihood) and QL(quasilikelihood) methods. Asymptotic distributions are derived and it is interesting to note for non-stationary case that both QML and QL give asymptotic normal distributions.

A STUDY ON PROCESS CAPABILITY INDICES FOR NON-NORMAL DATA

  • Kwon Seungsoo;Park Sung H.;Xu Jichao
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 1998년도 The 12th Asia Quality Management Symposium* Total Quality Management for Restoring Competitiveness
    • /
    • pp.159-173
    • /
    • 1998
  • Quality characteristics on the properties of process capability indices (PCIs) are often required to be normally distributed. But, if a characteristic is not normally distributed, serious errors can result from normal-based techniques. In this case, we may well consider the use of new PCIs specially designed to be robust for non-normality. In this paper, a newly proposed measure of process capability is introduced and compared with existing PCIs using the simulated non-normal data.

  • PDF

OPTIMAL APPROXIMATION BY ONE GAUSSIAN FUNCTION TO PROBABILITY DENSITY FUNCTIONS

  • Gwang Il Kim;Seung Yeon Cho;Doobae Jun
    • East Asian mathematical journal
    • /
    • 제39권5호
    • /
    • pp.537-547
    • /
    • 2023
  • In this paper, we introduce the optimal approximation by a Gaussian function for a probability density function. We show that the approximation can be obtained by solving a non-linear system of parameters of Gaussian function. Then, to understand the non-normality of the empirical distributions observed in financial markets, we consider the nearly Gaussian function that consists of an optimally approximated Gaussian function and a small periodically oscillating density function. We show that, depending on the parameters of the oscillation, the nearly Gaussian functions can have fairly thick heavy tails.

Box-Cox변환을 이용한 다변량 공정능력 분석 (Analysis of Multivariate Process Capability Using Box-Cox Transformation)

  • 문혜진;정영배
    • 산업경영시스템학회지
    • /
    • 제42권2호
    • /
    • pp.18-27
    • /
    • 2019
  • The process control methods based on the statistical analysis apply the analysis method or mathematical model under the assumption that the process characteristic is normally distributed. However, the distribution of data collected by the automatic measurement system in real time is often not followed by normal distribution. As the statistical analysis tools, the process capability index (PCI) has been used a lot as a measure of process capability analysis in the production site. However, PCI has been usually used without checking the normality test for the process data. Even though the normality assumption is violated, if the analysis method under the assumption of the normal distribution is performed, this will be an incorrect result and take a wrong action. When the normality assumption is violated, we can transform the non-normal data into the normal data by using an appropriate normal transformation method. There are various methods of the normal transformation. In this paper, we consider the Box-Cox transformation among them. Hence, the purpose of the study is to expand the analysis method for the multivariate process capability index using Box-Cox transformation. This study proposes the multivariate process capability index to be able to use according to both methodologies whether data is normally distributed or not. Through the computational examples, we compare and discuss the multivariate process capability index between before and after Box-Cox transformation when the process data is not normally distributed.

PICARD VALUES AND NORMALITY CRITERION

  • Fang, Ming-Liang
    • 대한수학회보
    • /
    • 제38권2호
    • /
    • pp.379-387
    • /
    • 2001
  • In this paper, we study the value distribution of meromorphic functions and prove the following theorem: Let f(z) be a transcendental meromorphic function. If f and f'have the same zeros, then f'(z) takes any non-zero value b infinitely many times.

  • PDF

Bayes Prediction Density in Linear Models

  • Kim, S.H.
    • Communications for Statistical Applications and Methods
    • /
    • 제8권3호
    • /
    • pp.797-803
    • /
    • 2001
  • This paper obtained Bayes prediction density for the spatial linear model with non-informative prior. It showed the results that predictive inferences is completely unaffected by departures from the normality assumption in the direction of the elliptical family and the structure of prediction density is unchanged by more than one additional future observations.

  • PDF