• Title/Summary/Keyword: statistics based method

Search Result 2,135, Processing Time 0.028 seconds

How to identify fake images? : Multiscale methods vs. Sherlock Holmes

  • Park, Minsu;Park, Minjeong;Kim, Donghoh;Lee, Hajeong;Oh, Hee-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.6
    • /
    • pp.583-594
    • /
    • 2021
  • In this paper, we propose wavelet-based procedures to identify the difference between images, including portraits and handwriting. The proposed methods are based on a novel combination of multiscale methods with a regularization technique. The multiscale method extracts the local characteristics of an image, and the distinct features are obtained through the regularized regression of the local characteristics. The regularized regression approach copes with the high-dimensional problem to build the relation between the local characteristics. Lytle and Yang (2006) introduced the detection method of forged handwriting via wavelets and summary statistics. We expand the scope of their method to the general image and significantly improve the results. We demonstrate the promising empirical evidence of the proposed method through various experiments.

Nonparametric analysis of income distributions among different regions based on energy distance with applications to China Health and Nutrition Survey data

  • Ma, Zhihua;Xue, Yishu;Hu, Guanyu
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.1
    • /
    • pp.57-67
    • /
    • 2019
  • Income distribution is a major concern in economic theory. In regional economics, it is often of interest to compare income distributions in different regions. Traditional methods often compare the income inequality of different regions by assuming parametric forms of the income distributions, or using summary statistics like the Gini coefficient. In this paper, we propose a nonparametric procedure to test for heterogeneity in income distributions among different regions, and a K-means clustering procedure for clustering income distributions based on energy distance. In simulation studies, it is shown that the energy distance based method has competitive results with other common methods in hypothesis testing, and the energy distance based clustering method performs well in the clustering problem. The proposed approaches are applied in analyzing data from China Health and Nutrition Survey 2011. The results indicate that there are significant differences among income distributions of the 12 provinces in the dataset. After applying a 4-means clustering algorithm, we obtained the clustering results of the income distributions in the 12 provinces.

Unsupervised Speaker Adaptation Based on Sufficient HMM Statistics (SUFFICIENT HMM 통계치에 기반한 UNSUPERVISED 화자 적응)

  • Ko Bong-Ok;Kim Chong-Kyo
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.127-130
    • /
    • 2003
  • This paper describes an efficient method for unsupervised speaker adaptation. This method is based on selecting a subset of speakers who are acoustically close to a test speaker, and calculating adapted model parameters according to the previously stored sufficient HMM statistics of the selected speakers' data. In this method, only a few unsupervised test speaker's data are required for the adaptation. Also, by using the sufficient HMM statistics of the selected speakers' data, a quick adaptation can be done. Compared with a pre-clustering method, the proposed method can obtain a more optimal speaker cluster because the clustering result is determined according to test speaker's data on-line. Experiment results show that the proposed method attains better improvement than MLLR from the speaker independent model. Moreover the proposed method utilizes only one unsupervised sentence utterance, while MLLR usually utilizes more than ten supervised sentence utterances.

  • PDF

A comparative study of the Gini coefficient estimators based on the regression approach

  • Mirzaei, Shahryar;Borzadaran, Gholam Reza Mohtashami;Amini, Mohammad;Jabbari, Hadi
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.4
    • /
    • pp.339-351
    • /
    • 2017
  • Resampling approaches were the first techniques employed to compute a variance for the Gini coefficient; however, many authors have shown that an analysis of the Gini coefficient and its corresponding variance can be obtained from a regression model. Despite the simplicity of the regression approach method to compute a standard error for the Gini coefficient, the use of the proposed regression model has been challenging in economics. Therefore in this paper, we focus on a comparative study among the regression approach and resampling techniques. The regression method is shown to overestimate the standard error of the Gini index. The simulations show that the Gini estimator based on the modified regression model is also consistent and asymptotically normal with less divergence from normal distribution than other resampling techniques.

BOOTSTRAP TESTS FOR THE EQUALITY OF DISTRIBUTIONS

  • Ping, Jing
    • Journal of applied mathematics & informatics
    • /
    • v.7 no.2
    • /
    • pp.467-482
    • /
    • 2000
  • Testing equality of two and k distributions has long been an interesting issue in statistical inference. To overcome the sparseness of data points in high-dimensional space and deal with the general cases, we suggest several projection pursuit type statistics. Some results on the limiting distributions of the statistics are obtained, some properties of Bootstrap approximation are investigated. Furthermore, for computational reasons an approximation for the statistics the based on Number theoretic method is applied. Several simulation experiments are performed.

A Study on Variance Change Point Detection for Time Series Data in Progress (진행중인 시계열데이터에서 분산 변화점 탐지에 관한 연구)

  • Choi Hyun-Seok;Kang Hoon-Kyu;Song Gyu-Moon;Kim Tae-Yoon
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.369-377
    • /
    • 2006
  • This paper considers moving variance ratio (MVR) for valiance detection problem with time series data in progress. For testing purpose, parametric method based on F distribution and nonparametric method based on empirical distribution are compared via simulation study.

A Test for Independence between Two Infinite Order Autoregressive Processes

  • Kim, Eun-Hee;Lee, Sang-Yeol
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.191-197
    • /
    • 2003
  • This paper considers the independence test for two stationary infinite order autoregressive processes. For a test, we follow the empirical process method devised by Hoeffding (1948) and Blum, Kiefer and Rosenblatt (1961), and construct the Cram${\acute{e}}$r-von Mises type test statistics based on the least squares residuals. It is shown that the proposed test statistics behave asymptotically the same as those based on true errors.

  • PDF

On Distribution of Order Statistics from Kumaraswamy Distribution

  • Garg, Mridula
    • Kyungpook Mathematical Journal
    • /
    • v.48 no.3
    • /
    • pp.411-417
    • /
    • 2008
  • In the present paper we derive the distribution of single order statistics, joint distribution of two order statistics and the distribution of product and quotient of two order statistics when the independent random variables are from continuous Kumaraswamy distribution. In particular the distribution of product and quotient of extreme order statistics and consecutive order statistics have also been obtained. The method used is based on Mellin transform and its inverse.

Method-Free Permutation Predictor Hypothesis Tests in Sufficient Dimension Reduction

  • Lee, Kyungjin;Oh, Suji;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.4
    • /
    • pp.291-300
    • /
    • 2013
  • In this paper, we propose method-free permutation predictor hypothesis tests in the context of sufficient dimension reduction. Different from an existing method-free bootstrap approach, predictor hypotheses are evaluated based on p-values; therefore, usual statistical practitioners should have a potential preference. Numerical studies validate the developed theories, and real data application is provided.

Estimation on a two-parameter Rayleigh distribution under the progressive Type-II censoring scheme: comparative study

  • Seo, Jung-In;Seo, Byeong-Gyu;Kang, Suk-Bok
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.2
    • /
    • pp.91-102
    • /
    • 2019
  • In this paper, we propose a new estimation method based on a weighted linear regression framework to obtain some estimators for unknown parameters in a two-parameter Rayleigh distribution under a progressive Type-II censoring scheme. We also provide unbiased estimators of the location parameter and scale parameter which have a nuisance parameter, and an estimator based on a pivotal quantity which does not depend on the other parameter. The proposed weighted least square estimator (WLSE) of the location parameter is not dependent on the scale parameter. In addition, the WLSE of the scale parameter is not dependent on the location parameter. The results are compared with the maximum likelihood method and pivot-based estimation method. The assessments and comparisons are done using Monte Carlo simulations and real data analysis. The simulation results show that the estimators ${\hat{\mu}}_u({\hat{\theta}}_p)$ and ${\hat{\theta}}_p({\hat{\mu}}_u)$ are superior to the other estimators in terms of the mean squared error (MSE) and bias.