• 제목/요약/키워드: Statistics Matching

검색결과 184건 처리시간 0.018초

Noninformative priors for Pareto distribution

  • Kim, Dal-Ho;Kang, Sang-Gil;Lee, Woo-Dong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제20권6호
    • /
    • pp.1213-1223
    • /
    • 2009
  • In this paper, we develop noninformative priors for two parameter Pareto distribution. Specially, we derive Jereys' prior, probability matching prior and reference prior for the parameter of interest. In our case, the probability matching prior is only a first order matching prior and there does not exist a second order matching prior. Some simulation reveals that the matching prior performs better to achieve the coverage probability. A real example is also considered.

  • PDF

NONINFORMATIVE PRIORS FOR LINEAR COMBINATION OF THE INDEPENDENT NORMAL MEANS

  • Kang, Sang-Gil;Kim, Dal-Ho;Lee, Woo-Dong
    • Journal of the Korean Statistical Society
    • /
    • 제33권2호
    • /
    • pp.203-218
    • /
    • 2004
  • In this paper, we develop the matching priors and the reference priors for linear combination of the means under the normal populations with equal variances. We prove that the matching priors are actually the second order matching priors and reveal that the second order matching priors match alternative coverage probabilities up to the second order (Mukerjee and Reid, 1999) and also, are HPD matching priors. It turns out that among all of the reference priors, one-at-a-time reference prior satisfies a second order matching criterion. Our simulation study indicates that one-at-a-time reference prior performs better than the other reference priors in terms of matching the target coverage probabilities in a frequentist sense. We compute Bayesian credible intervals for linear combination of the means based on the reference priors.

성향점수매칭 방법을 사용한 로지스틱 회귀분석에 관한 연구 (On Logistic Regression Analysis Using Propensity Score Matching)

  • 김소연;백종일
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제16권4호
    • /
    • pp.323-330
    • /
    • 2016
  • Purpose: Recently, propensity score matching method is used in a large number of research paper, nonetheless, there is no research using fitness test of before and after propensity score matching. Therefore, comparing fitness of before and after propensity score matching by logistic regression analysis using data from 'online survey of adolescent health' is the main significance of this research. Method: Data that has similar propensity in two groups is extracted by using propensity score matching then implement logistic regression analysis on before and after matching separately. Results: To test fitness of logistic regression analysis model, we use Model summary, -2Log Likelihood and Hosmer-Lomeshow methods. As a result, it is confirmed that the data after matching is more suitable for logistic regression analysis than data before matching. Conclusion: Therefore, better result which has appropriate fitness will be shown by using propensity score matching shows better result which has better fitness.

Statistical micro matching using a multinomial logistic regression model for categorical data

  • Kim, Kangmin;Park, Mingue
    • Communications for Statistical Applications and Methods
    • /
    • 제26권5호
    • /
    • pp.507-517
    • /
    • 2019
  • Statistical matching is a method of combining multiple sources of data that are extracted or surveyed from the same population. It can be used in situation when variables of interest are not jointly observed. It is a low-cost way to expect high-effects in terms of being able to create synthetic data using existing sources. In this paper, we propose the several statistical micro matching methods using a multinomial logistic regression model when all variables of interest are categorical or categorized ones, which is common in sample survey. Under conditional independence assumption (CIA), a mixed statistical matching method, which is useful when auxiliary information is not available, is proposed. We also propose a statistical matching method with auxiliary information that reduces the bias of the conventional matching methods suggested under CIA. Through a simulation study, proposed micro matching methods and conventional ones are compared. Simulation study shows that suggested matching methods outperform the existing ones especially when CIA does not hold.

A𝛼-SPECTRAL EXTREMA OF GRAPHS WITH GIVEN SIZE AND MATCHING NUMBER

  • Xingyu Lei;Shuchao Li;Jianfeng Wang
    • 대한수학회보
    • /
    • 제60권4호
    • /
    • pp.873-893
    • /
    • 2023
  • In 2017, Nikiforov proposed the A𝛼-matrix of a graph G. This novel matrix is defined as A𝛼(G) = 𝛼D(G) + (1 - 𝛼)A(G), 𝛼 ∈ [0, 1], where D(G) and A(G) are the degree diagonal matrix and adjacency matrix of G, respectively. Recently, Zhai, Xue and Liu [39] considered the Brualdi-Hoffman-type problem for Q-spectra of graphs with given matching number. As a continuance of it, in this contribution we consider the Brualdi-Hoffman-type problem for A𝛼-spectra of graphs with given matching number. We identify the graphs with given size and matching number having the largest A𝛼-spectral radius for ${\alpha}{\in}[{\frac{1}{2}},1)$.

NONINFORMATIVE PRIORS FOR PARETO DISTRIBUTION : REGULAR CASE

  • 김달호;이우동;강상길
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2003년도 춘계학술대회
    • /
    • pp.27-37
    • /
    • 2003
  • In this paper, we develop noninformative priors for two parameter Pareto distribution. Specially, we derive Jeffrey's prior, probability matching prior and reference prior for the parameter of interest. In our case, the probability matching prior is only a first order and there does not exist a second order matching prior. Some simulation reveals that the matching prior performs better to achieve the coverage probability. And a real example will be given.

  • PDF

Association Rule Mining by Environmental Data Fusion

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권2호
    • /
    • pp.279-287
    • /
    • 2007
  • Data fusion is the process of combining multiple data in order to produce information of tactical value to the user. Data fusion is generally defined as the use of techniques that combine data from multiple sources and gather that information in order to achieve inferences. Data fusion is also called data combination or data matching. Data fusion is divided in five branch types which are exact matching, judgemental matching, probability matching, statistical matching, and data linking. In this paper, we develop was macro program for statistical matching which is one of five branch types for data fusion. And then we apply data fusion and association rule techniques to environmental data.

  • PDF

A Robust Approach of Regression-Based Statistical Matching for Continuous Data

  • Sohn, Soon-Cheol;Jhun, Myoung-Shic
    • 응용통계연구
    • /
    • 제25권2호
    • /
    • pp.331-339
    • /
    • 2012
  • Statistical matching is a methodology used to merge microdata from two (or more) files into a single matched file, the variants of which have been extensively studied. Among existing studies, we focused on Moriarity and Scheuren's (2001) method, which is a representative method of statistical matching for continuous data. We examined this method and proposed a revision to it by using a robust approach in the regression step of the procedure. We evaluated the efficiency of our revised method through simulation studies using both simulated and real data, which showed that the proposed method has distinct advantages over existing alternatives.

로버스트 회귀모형을 이용한 자료결합방법 (Statistical Matching Techniques Using the Robust Regression Model)

  • 전명식;정시송;박혜진
    • 응용통계연구
    • /
    • 제21권6호
    • /
    • pp.981-996
    • /
    • 2008
  • 서로 다른 출처로부터 얻어진 데이터 파일들을 하나의 데이터 파일로 만드는 통계적 자료결합방법은 공통변수와 서로 다른 고유변수를 포함하여 변수들 간에 존재하는 관련성에 대해 살펴볼 수 있다. Robin (1986)이 제안한 일반회귀모형의 예측값을 이용한 통계적 결합방법은 자료에 대한 다변량 정규성을 가정하기 때문에 이 가정을 위반하는 자료를 이용하는 것은 많은 문제를 수반한다. 본 연구는 제공파일의 고유변수에 모분포를 반영하지 못하는 특이점이 존재하는 경우, 일반회귀모형을 이용한 통계적 결합방법의 대안으로 로러스트 회귀추정방법을 이용한 자료결합방법을 제안하였다. 나아가 로버스트 회귀모형을 이용한 결합방법과 일반회귀모형을 이용한 결합방법에서의 상관관계 및 결정계수 보존에 관한 성능을 비교하기 위하여 모의실험을 수행하였다.

Environmental Survey Data Analysis by Data Fusion Techniques

  • Cho, Kwang-Hyun;Park, Hee-Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권4호
    • /
    • pp.1201-1208
    • /
    • 2006
  • Data fusion is generally defined as the use of techniques that combine data from multiple sources and gather that information in order to achieve inferences. Data fusion is also called data combination or data matching. Data fusion is divided in five branch types which are exact matching, judgemental matching, probability matching, statistical matching, and data linking. Currently, Gyeongnam province is executing the social survey every year with the provincials. But, they have the limit of the analysis as execute the different survey to 3 year cycles. In this paper, we study to data fusion of environmental survey data using sas macro. We can use data fusion outputs in environmental preservation and environmental improvement.

  • PDF