• Title/Summary/Keyword: statistical invariant

Search Result 65, Processing Time 0.023 seconds

The Null Distribution of the Likelihood Ratio Test for a Mixture of Two Gammas

  • Min, Dae-Hee
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.289-298
    • /
    • 1998
  • We investigate the distribution of likelihood ratio test(LRT) of null hypothesis a sample is from single gamma with unknown shape and scale against the alternative hypothesis a sample is from a mixture of two gammas, each with unknown scale and unknown (but equal) scale. To obtain stable maximum likelihood estimates(MLE) of a mixture of two gamma distributions, the EM(Dempster, Laird, and Robin(1977))and Modified Newton(Jensen and Johansen(1991)) algorithms were implemented. Based on EM, we made a simple structure likelihood equation for each parameter and could obtain stable solution by Modified Newton Algorithms. Simulation study was conducted to investigate the distribution of LRT for sample size n = 25, 50, 75, 100, 50, 200, 300, 400, 500 with 2500 replications. To determine the small sample distribution of LRT, I considered the model of a gamma distribution with shape parameter equal to 1 + f(n) and scale parameter equal to 2. The simulation results indicate that the null distribution is essentially invariant to the value of the shape parameter. Modeling of the null distribution indicates that it is well approximated by a gamma distribution with shape parameter equal to the quantity $0.927+1.18/\sqrt{n}$ and scale parameter equal to 2.16.

  • PDF

Double K-Means Clustering (이중 K-평균 군집화)

  • 허명회
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.343-352
    • /
    • 2000
  • In this study. the author proposes a nonhierarchical clustering method. called the "Double K-Means Clustering", which performs clustering of multivariate observations with the following algorithm: Step I: Carry out the ordinary K-means clmitering and obtain k temporary clusters with sizes $n_1$,... , $n_k$, centroids $c_$1,..., $c_k$ and pooled covariance matrix S. $\bullet$ Step II-I: Allocate the observation x, to the cluster F if it satisfies ..... where N is the total number of observations, for -i = 1, . ,N. $\bullet$ Step II-2: Update cluster sizes $n_1$,... , $n_k$, centroids $c_$1,..., $c_k$ and pooled covariance matrix S. $\bullet$ Step II-3: Repeat Steps II-I and II-2 until the change becomes negligible. The double K-means clustering is nearly "optimal" under the mixture of k multivariate normal distributions with the common covariance matrix. Also, it is nearly affine invariant, with the data-analytic implication that variable standardizations are not that required. The method is numerically demonstrated on Fisher's iris data.

  • PDF

Stabilization of the Time-variant Cointegrating Relations (시간가변적 공적분관계의 안정화)

  • Kim, Tae-Ho;Park, Ji-Won
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.5
    • /
    • pp.727-738
    • /
    • 2008
  • If a cointegrating relation is affected by important economic and political events occurred in the sample period, the assumption of the time-invariant cointegrating vector is violated, which leads to the misrep-resentation of the actual relations between the variables. From such a viewpoint, this study utilizes the recursive estimation process in testing for the stability of the long-run equilibrium of the domestic stock market system and then attempts to develop the framework for stabilizing time-variant cointegraing relations by introducing the dummy variables where the structural changes are found to exist.

Edge Detection using Morphological Amoebas Noisy Images (잡음영상에서 아메바를 이용한 형태학적 에지검출)

  • Lee, Won-Yeol;Kim, Se-Yun;Kim, Young-Woo;Lim, Jae-Young;Lim, Dong-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.3
    • /
    • pp.569-584
    • /
    • 2009
  • Edge detection in images has been widely used in image processing system and computer vision. Morphological edge detection has used structuring elements with fixed shapes. This paper presents morphological operators with non-fixed shape kernels, or amoebas, which take into account the image contour variations to adapt their shape. Experimental results are analyzed in both qualitative analysis through visual inspection and quantitative analysis with PFOM and ROC curves. The Experiments demonstrate that these novel operators outperform classical morphological operations with a fixed, space-invariant structuring elements for edge detection applications.

Bootstrap inference for covariance matrices of two independent populations (두 독립 모집단의 공분산 행렬에 대한 붓스트랩 추론)

  • 김기영;전명식
    • The Korean Journal of Applied Statistics
    • /
    • v.4 no.1
    • /
    • pp.1-11
    • /
    • 1991
  • It is of great interest to consider the homogeniety of covariance matrices in MANOVA of discriminant analysis. If we lock at the problem of testing hypothesis, H : $\Sigma_1 = \Sigma_2$ from an invariance point of view where $\Sigma_i$ are the covariance matrix of two independent p-variate distribution, the testing problem is invariant under the group of nonsingular transformations and the hypothesis becomes H : $\delta_1 = \delta_2 = \cdots = \delta_p = 1$ where $\delta = (\delta_1, \delta_2, \cdots, \delta_p)$ is a vector of latent roots of $\Sigma$. Bias-corrected estimators of eigenvalues and sampling distribution of the test statistics proposed are obtained. Pooled-bootstrap method also considered for Bartlett's modified likelihood ratio statistics.

  • PDF

Estimation of the Number of Sources Based on Hypothesis Testing

  • Xiao, Manlin;Wei, Ping;Tai, Heng-Ming
    • Journal of Communications and Networks
    • /
    • v.14 no.5
    • /
    • pp.481-486
    • /
    • 2012
  • Accurate and efficient estimation of the number of sources is critical for providing the parameter of targets in problems of array signal processing and blind source separation among other such problems. When conventional estimators work in unfavorable scenarios, e.g., at low signal-to-noise ratio (SNR), with a small number of snapshots, or for sources with a different strength, it is challenging to maintain good performance. In this paper, the detection limit of the minimum description length (MDL) estimator and the signal strength required for reliable detection are first discussed. Though a comparison, we analyze the reason that performances of classical estimators deteriorate completely in unfavorable scenarios. After discussing the limiting distribution of eigenvalues of the sample covariance matrix, we propose a new approach for estimating the number of sources which is based on a sequential hypothesis test. The new estimator performs better in unfavorable scenarios and is consistent in the traditional asymptotic sense. Finally, numerical evaluations indicate that the proposed estimator performs well when compared with other traditional estimators at low SNR and in the finite sample size case, especially when weak signals are superimposed on the strong signals.

Photon-counting linear discriminant analysis for face recognition at a distance

  • Yeom, Seok-Won
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.12 no.3
    • /
    • pp.250-255
    • /
    • 2012
  • Face recognition has wide applications in security and surveillance systems as well as in robot vision and machine interfaces. Conventional challenges in face recognition include pose, illumination, and expression, and face recognition at a distance involves additional challenges because long-distance images are often degraded due to poor focusing and motion blurring. This study investigates the effectiveness of applying photon-counting linear discriminant analysis (Pc-LDA) to face recognition in harsh environments. A related technique, Fisher linear discriminant analysis, has been found to be optimal, but it often suffers from the singularity problem because the number of available training images is generally much smaller than the number of pixels. Pc-LDA, on the other hand, realizes the Fisher criterion in high-dimensional space without any dimensionality reduction. Therefore, it provides more invariant solutions to image recognition under distortion and degradation. Two decision rules are employed: one is based on Euclidean distance; the other, on normalized correlation. In the experiments, the asymptotic equivalence of the photon-counting method to the Fisher method is verified with simulated data. Degraded facial images are employed to demonstrate the robustness of the photon-counting classifier in harsh environments. Four types of blurring point spread functions are applied to the test images in order to simulate long-distance acquisition. The results are compared with those of conventional Eigen face and Fisher face methods. The results indicate that Pc-LDA is better than conventional facial recognition techniques.

Privacy Preserving Data Publication of Dynamic Datasets (프라이버시를 보호하는 동적 데이터의 재배포 기법)

  • Lee, Joo-Chang;Ahn, Sung-Joon;Won, Dong-Ho;Kim, Ung-Mo
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.18 no.6A
    • /
    • pp.139-149
    • /
    • 2008
  • The amount of personal information collected by organizations and government agencies is continuously increasing. When a data collector publishes personal information for research and other purposes, individuals' sensitive information should not be revealed. On the other hand, published data is also required to provide accurate statistical information for analysis. k-Anonymity and ${\iota}$-diversity models are popular approaches for privacy preserving data publication. However, they are limited to static data release. After a dataset is updated with insertions and deletions, a data collector cannot safely release up-to-date information. Recently, the m-invariance model has been proposed to support re-publication of dynamic datasets. However, the m-invariant generalization can cause high information loss. In addition, if the adversary already obtained sensitive values of some individuals before accessing released information, the m-invariance leads to severe privacy disclosure. In this paper, we propose a novel technique for safely releasing dynamic datasets. The proposed technique offers a simple and effective method for handling inserted and deleted records without generalization. It also gives equivalent degree of privacy preservation to the m-invariance model.

A Comparison of Two Models for Forecasting Mortality in South Korea (사망률 예측을 위한 모형 비교)

  • Park Yousung;Kim Kee Whan;Lee Dong-Hee;Lee Yeon Kyung
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.3
    • /
    • pp.639-654
    • /
    • 2005
  • The Lee and Carter method has widely used to forecast mortality because of the simple structure of model and the stable forecasting. The Lee and Carter method, however, also has limitations. The assumption of the rate of decline in mortality at each age remaining invariant over time has been violated in several decades. And, there is no way to include covariates in the model for better forecasts. Here we introduce Park, Choi and Kim method to make up for Lee and Carter's weak points by using two random processes. We discuss structural features of two methods. furthermore, for each method, we forecast life expectancy for 2005 to 2050 using South Korea data and compare the results.

A Fingerprint Identification System using Large Database (대용량 DB를 사용한 지문인식 시스템)

  • Cha, Jeong-Hee;Seo, Jeong-Man
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.4 s.36
    • /
    • pp.203-211
    • /
    • 2005
  • In this paper, we propose a new automatic fingerprint identification system that identifies individuals in large databases. The algorithm consists of three steps; preprocessing, classification, and matching, in the classification. we present a new classification technique based on the statistical approach for directional image distribution. In matching, we also describe improved minutiae candidate pair extraction algorithm that is faster and more accurate than existing algorithm. In matching stage, we extract fingerprint minutiaes from its thinned image for accuracy, and introduce matching process using minutiae linking information. Introduction of linking information into the minutiae matching process is a simple but accurate way, which solves the problem of reference minutiae pair selection in comparison stage of two fingerprints quickly. This algorithm is invariant to translation and rotation of fingerprint. The proposed system was tested on 1000 fingerprint images from the semiconductor chip style scanner. Experimental results reveal false acceptance rate is decreased and genuine acceptance rate is increased than existing method.

  • PDF