• Title/Summary/Keyword: 정규분포 검정

Search Result 159, Processing Time 0.019 seconds

Remarks on the Use of Multivariate Skewness and Kurtosis for Testing Multivariate Normality (정규성 검정을 위한 다변량 왜도와 첨도의 이용에 대한 고찰)

  • 김남현
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.3
    • /
    • pp.507-518
    • /
    • 2004
  • Malkovich & Afifi (1973) generalized the univariate skewness and kurtosis to test a hypothesis of multivariate normality by use of the union-intersection principle. However these statistics are hard to compute for high dimensions. We propose the approximate statistics to them, which are practical for a high dimensional data set. We also compare the proposed statistics to Mardia(1970)'s multivariate skewness and kurtosis by a Monte Carlo study.

Likelihood based inference for the ratio of parameters in two Maxwell distributions (두 개의 맥스웰분포의 모수비에 대한 우도함수 추론)

  • Kang, Sang-Gil;Lee, Jeong-Hee;Lee, Woo-Dong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.1
    • /
    • pp.89-98
    • /
    • 2012
  • In this paper, the ratio of parameters in two independent Maxwell distributions is parameter of interest. We proposed test statistics, which converge to standard normal distribution, based on likelihood function. The exact distribution for testing the ratio is hard to obtain. We proposed the signed log-likelihood ratio statistic and the modified signed log-likelihood ratio statistic for testing the ratio. Through simulation, we show that the modified signed log-likelihood ratio statistic converges faster than signed log-likelihood ratio statistic to standard normal distribution. We compare two statistics in terms of type I error and power. We give an example using real data.

An Approach for the Estimation of Mixture Distribution Parameters Using EM Algorithm (복합확률분포의 파라메타 추정을 위한 EM 알고리즘의 적용 연구)

  • Daeyoung Shim;SangGu Kim
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.22 no.4
    • /
    • pp.35-47
    • /
    • 2023
  • Various single probability distributions have been used to represent time headway distributions. However, it has often been difficult to explain the time headway distribution as a single probability distribution on site. This study used the EM algorithm, which is one of the maximum likelihood estimations, for the parameters of combined mixture distributions with a certain relationship between two normal distributions for the time headway of vehicles. The time headway distribution of vehicle arrival is difficult to represent well with previously known single probability distributions. But as a result of this analysis, it can be represented by estimating the parameters of the mixture probability distribution using the EM algorithm. The result of a goodness-of-fit test was statistically significant at a significance level of 1%, which proves the reliability of parameter estimation of the mixture probability distribution using the EM algorithm.

Comparison of Some Nonparametric Statistical Inference for Logit Model (로짓모형의 비모수적 추론의 비교)

  • 정형철;김대학
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.355-366
    • /
    • 2002
  • Nonparametric statistical inference for the parameter of logit model were examined. Usually nonparametric approach is milder than parametric approach based on normal theory assumption. We compared the two nonparametric methods for legit model, the bootstrap and random permutation in the sense of coverage probability. Monte Carlo simulation is conducted for small sample cases. Empirical power of hypothesis test and coverage probability for confidence interval estimation were presented for simple and multiple legit model respectively. An example were also introduced.

Estimation of Berthing Velocity Using Probability Distribution Characteristics in Tanker Terminal (확률분포 특성을 이용한 탱커부두에서의 선박접안속도 예측값 추정)

  • Lee, Sang-Won;Cho, Jang-Won;Cho, Ik-Soon
    • Journal of Navigation and Port Research
    • /
    • v.43 no.3
    • /
    • pp.186-196
    • /
    • 2019
  • Berthing energy is majorly influenced by the berthing velocity. It is necessary to design an appropriate berthing velocity for each pier, since excessive berthing velocity can cause berthing accident causing damage to the ship and pier. In this study, as a statistical approach for berthing velocity, the probability distributions suitable for the berthing velocities were confirmed using the K-S test, the A-D test and the Q-Q plot. As a result, the frequency distribution of the berthing velocity was found to be suitable using the Weibull distribution as well as the lognormal distribution. Additionally, the predicted values obtained through estimation of the berthing velocity using the concept of probability of exceedance in this study is proposed as a reference of design berthing velocity. It can be observed that the design berthing velocity is set to be somewhat low so that it does not practically match with the reality. This study and its results can be expected to contribute to the development of a proper design velocity calculation method.

The Role of the Cauchy Probability Distribution in a Continuous Taboo Search (연속형 타부 탐색에서 코시 확률 분포의 역할)

  • Lee, Chang-Yong;Lee, Dong-Ju
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.8
    • /
    • pp.591-598
    • /
    • 2010
  • In this study, we propose a new method for generating candidate solutions based on the Cauchy probability distribution in order to complement the shortcoming of the solutions generated by the normal distribution. The Cauchy probability distribution has infinite mean and variance, and it has rather large probability in the tail region relative to the normal distribution. Thus, the Cauchy distribution can yield higher probabilities of generating candidate solutions of large-varied variables, which in turn has an advantage of searching wider area of variable space. In order to compare and analyze the performance of the proposed method against the conventional method, we carried out an experiment using benchmarking problems of real valued function. From the result of the experiment, we found that the proposed method based on the Cauchy distribution outperformed the conventional one for all benchmarking problems, and verified its superiority by the statistical hypothesis test.

Analysis of extreme wind speed and precipitation using copula (코플라함수를 이용한 극단치 강풍과 강수 분석)

  • Kwon, Taeyong;Yoon, Sanghoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.4
    • /
    • pp.797-810
    • /
    • 2017
  • The Korean peninsula is exposed to typhoons every year. Typhoons cause huge socioeconomic damage because tropical cyclones tend to occur with strong winds and heavy precipitation. In order to understand the complex dependence structure between strong winds and heavy precipitation, the copula links a set of univariate distributions to a multivariate distribution and has been actively studied in the field of hydrology. In this study, we carried out analysis using data of wind speed and precipitation collected from the weather stations in Busan and Jeju. Log-Normal, Gamma, and Weibull distributions were considered to explain marginal distributions of the copula. Kolmogorov-Smirnov, Cramer-von-Mises, and Anderson-Darling test statistics were employed for testing the goodness-of-fit of marginal distribution. Observed pseudo data were calculated through inverse transformation method for establishing the copula. Elliptical, archimedean, and extreme copula were considered to explain the dependence structure between strong winds and heavy precipitation. In selecting the best copula, we employed the Cramer-von-Mises test and cross-validation. In Busan, precipitation according to average wind speed followed t copula and precipitation just as maximum wind speed adopted Clayton copula. In Jeju, precipitation according to maximum wind speed complied Normal copula and average wind speed as stated in precipitation followed Frank copula and maximum wind speed according to precipitation observed Husler-Reiss copula.

서평 : 윤기중 저, 수리통계학, 서울 : 박영사, 1974

  • 백운붕
    • Journal of the Korean Statistical Society
    • /
    • v.3 no.1
    • /
    • pp.65-66
    • /
    • 1974
  • 통계학의 수리론을 전개한 우리의 저서가 별로 없는 터에 윤기중교수의 '수리통계학'이 박영사를 통하여 간행되었다. 이책은 미적분에 관한 수학지식으로 능히 독파할 수 있도록 순차적으로 차분하게 기술되어 있다. 집합론의 개념에서부터 시작하여 확률론의 기초사항을 친절하게 설명하고 연속확률변수의 분포, 확률표본, 점추정, 다변량정규분포, 각종 통계량의 분포, 통계적 가설검정, 구간추정, 그리고 끝으로 회귀와 상관분석에 이르기까지 각종항목에 걸쳐서 통계학이론이 빠짐없이 기술되어 있다.

  • PDF

Optimal Thresholds from Mixture Distributions (혼합분포에서 최적분류점)

  • Hong, Chong-Sun;Joo, Jae-Seon;Choi, Jin-Soo
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.1
    • /
    • pp.13-28
    • /
    • 2010
  • Assuming a mixture distribution for credit evaluation studies, we discuss estimating threshold methods to minimize errors that default borrowers are predicted as non defaults or non defaults are regarded as defaults. A method by using statistical hypotheses tests, the most powerful test and generalized likelihood ratio test, for the probability density functions which are defined with the score random variable and the parameter space consisted of only two elements such as the default and non default states is proposed to estimate a threshold. And anther optimal thresholds to maximize classification accuracy measures of the accuracy and the true rate for ROC and CAP curves are estimated as equations related with these probability density functions. Three kinds of optimal thresholds in terms of the hypotheses testing, the accuracy and the true rate are obtained from normal random samples with various means and variances. The sums of the type I and type II errors corresponding to each optimal threshold are obtained and compared. Finally we discuss about their efficiency and derive conclusions.

A Study on Gene Search Using Test for Interval Data (구간형 데이터 검정법을 이용한 유전자 탐색에 관한 연구)

  • Lee, Seong-Keon
    • Journal of the Korean Data Analysis Society
    • /
    • v.20 no.6
    • /
    • pp.2805-2812
    • /
    • 2018
  • The methylation score, expressed as a percentage of the methylation status data derived from the iterative sequencing process, has a value between 0 and 1. It is contrary to the assumption of normal distribution that simply applying the t-test to examine the difference in population-specific methylation scores in these data. In addition, since the result may vary depending on the number of repetitions of sequencing in the process of methylation score generation, a method that can analyze such errors is also necessary. In this paper, we introduce the symbolic data analysis and the interval K-S test method which convert observation data into interval data including uncertainty rather than one numerical data. In addition, it is possible to analyze the characteristics of methylation score by using Beta distribution without using normal distribution in the process of converting into interval data. For the data analysis, the nature of the proposed method was examined using sequencing data of actual patients and normal persons. While the t-test is only possible for the location test, it is found that the interval type K-S statistic can be used to test not only the location parameter but also the heterogeneity of the distribution function.