• Title/Summary/Keyword: 다변량정규분포

Search Result 57, Processing Time 0.028 seconds

A Test of the Multivariate Normality Based on Likelihood Functions (가능도 함수를 기초로 한 다변량 정규성 검정)

  • Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.2
    • /
    • pp.223-232
    • /
    • 2002
  • The present paper develops a test of the multivariate normality based on nonlinear transformations and the likelihood function. For checking the normality, we test the shape parameter which indexes the family of transformations. A score test and a parametric bootstrap test are used to evaluate the discrepancy between the data and a multivariate normal distribution. In order to compare the performance of our test with the existing tests, a simulation study was carried out for several situations where nuisance parameters have to be estimated. The results showed that the proposed method is superior to the existing methods.

다변량 자료의 분산균일성 검정-피트만 방법의 확장-

  • 허명회;양경숙
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.1
    • /
    • pp.41-47
    • /
    • 1997
  • 본 연구의 목적은 p변량 관측지가 등상관구조를 갖는 경우 주변분산들의 균일성을 검정하는 통계적 절차를 개발하는 것이다. 이를 위하여 2변량의 경우에 적용되는 피트만(Pitman)의 방법을 3변량 이상의 경우로 확장하고 피셔(Fisher)의 임의화 검정을 적용하여 정규분포의 틀에 의존하지 않는 p 값을 산출한다.

  • PDF

한국증권시장(韓國證券市場)에서 대용시장(代用市場)포트폴리오효율성(效率性)의 GMM에 의한 다변량(多變量) 검증(檢證)

  • Gu, Bon-Yeol
    • The Korean Journal of Financial Management
    • /
    • v.15 no.1
    • /
    • pp.1-30
    • /
    • 1998
  • 본(本) 연구(硏究)는 한국증권시장에서 대표적 대용시장(代用市場)포트폴리오인 한국종합주가지수(韓國綜合株價指數)와 동일가중지수(同一加重指數)의 효율성에 대한 검증을 Hansen(1982)의 다변량의 GMM에 의하여 실시하고자 하였다. 이를 위하여 먼저, 주식수익률자료(株式收益率資料)에 근거한 산업별(産業別)포트폴리오수익률과 초과시장수익률모형(超過市場收益率模型)의 오차항(誤差項)이 정규분포(正規分布)를 벗어남을 증명함으로써 GMM검증방법(檢證方法)의 정당성을 찾고자 하였다. 정규분포에 대한 검증방법(檢證方法)으로서 왜도와 첨도의 검증과 이를 결합한 Jarque-Bera(1980)검증(檢證)을 실시하였다. 둘째로, Hansen(1982)의 GMM을 대용시장(代用市場)포트폴리오의 효율성(效率性) 검증(檢證)에 적용하는 방법에 대한 연구들인 Mackinlay-Richardson(1991), Harvey-Zhou(1993)와 Campbell-Lo-Mackilay(1997) 등을 기초로하여 이들의 방법론을 개선한 3가지의 효율성(效率性) 검증방법(檢證方法)을 제시하였다. 셋째로, 이상의 검증방법(檢證方法)들을 토대로 1980년 1월부터 1997년 6월까지 월별주식수익률(月別株式收益率)의 자료(資料)를 11업종으로 분류하여 산업별(産業別)포트폴리오수익률(收益率)과 초과시장수익률모형(超過市場收益率模型)에 의한 오차항(誤差項)이 정규분포(正規分布)를 따르는지와 아울러 대용시장(代用市場)포트폴리오의 효율성을 검증하였다. 검증결과(檢證結果), 산업별(産業別)포트폴리오수익률과 오차항(誤差項)은 대부분 정규성이 기각(棄却)되어 GMM검증방법(檢證方法)의 정당성이 입증되었다. 따라서 GMM에 의한 효율성(效率性)을 검증한 결과, 한국종합주가지수(韓國綜合株價指數)의 경우에는 평균-분산(平均-分散)프론티어(mean-variance frontier)상(上)에서의 대용시장(代用市場)포트폴리오의 효율성(效率性)은 기각(棄却) 할 수 없는 것으로 나타났으나 평균수익률(平均收益率)이 GMVP의 수익률보다 낮았기 때문에 효율적(效率的) 프론티어(efficient frontier)상(上)의 대용시장(代用市場)포트폴리오의 효율성(效率性)은 기각(棄却)되어 대용시장지수로서의 문제점이 있는 것으로 나타났다. 그러나 동일가중지수(同一加重指數)는 평균수익률이 GMVP의 수익률보다 높을 뿐만아니라 효율적(效率的) 프론티어상(上)의 대용시장(代用市場)포트폴리오의 효율성(效率性)도 채택되어 한국종합주가지수(韓國綜合株價指數)보다 우월한 지수(指數)인 것으로 나타났다.

  • PDF

Efficient variable selection method using conditional mutual information (조건부 상호정보를 이용한 분류분석에서의 변수선택)

  • Ahn, Chi Kyung;Kim, Donguk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1079-1094
    • /
    • 2014
  • In this paper, we study efficient gene selection methods by using conditional mutual information. We suggest gene selection methods using conditional mutual information based on semiparametric methods utilizing multivariate normal distribution and Edgeworth approximation. We compare our suggested methods with other methods such as mutual information filter, SVM-RFE, Cai et al. (2009)'s gene selection (MIGS-original) in SVM classification. By these experiments, we show that gene selection methods using conditional mutual information based on semiparametric methods have better performance than mutual information filter. Furthermore, we show that they take far less computing time than Cai et al. (2009)'s gene selection but have similar performance.

Development of the Hill-Sliding Clustering Algorithm Using BASIC Language (BASIC 언어를 사용한 Hill-Sliding 무감독 분류법 Algorithm 개발)

  • 鄭夢炫;崔圭弘;朴景允;Park, J.Kyoungyoon
    • Korean Journal of Remote Sensing
    • /
    • v.1 no.1
    • /
    • pp.89-97
    • /
    • 1985
  • An algorithm for the Hill-Sliding Clustering (HSC) method was developed using the BASIC language for Apple II personal computer. It was designed for initialization of clusters from multivariate multimodal Gaussian data. Landsat multispectral imagery data of a Korean coastal area were used for its performance test. The test showed encouraging results.

한국증권시장(韓國證券市場)에서 다변량검증(多變量檢證)에 근거한 CAPM과 APM의 실증적(實證的) 검증(檢證)

  • Gu, Bon-Yeol
    • The Korean Journal of Financial Studies
    • /
    • v.5 no.1
    • /
    • pp.135-164
    • /
    • 1999
  • 본(本) 연구(硏究)는 Jobson(1982)의 주식(株式)의 수익율(收益率)이 정규분포(正規分布)를 할 경우에 다변량(多變量)의 통계학(統計學)을 이용하여 CAPM과 APM을 검증(檢證)하는 방법(方法)을 유도하였다. 이에따라 회귀분석(回歸分析)에 의한 검증방법(檢證方法)과 다변량(多變量)의 검증방법(檢證方法)을 제시하고 현실적으로 CAPM과 APM이 한국증권시장(韓國證券市場)에서 적용가능(適用可能)한가에 대한 실증적(實證的) 검증(檢證)을 실시하였다. 실증적(實證的) 검증(檢證)을 위하여 먼저 우리나라의 주식수익율자료(株式收益率資料)를 1980년 1월부터 1997년 6월까지의 월별자료(月別資料)에 의하여 11개 산업별(産業別) 분류작업을 통하여 산업별(産業別)포트폴리오를 구성하였다. 특히 APM의 경우에는 요인(要因)의 증가에 따라 APM이 한국증권시장에서 적용가능한가를 검증(檢證)하기 위하여 요인(要因)을 2개, 6개 그리고 10개까지 증가시켜 모형(模型)의 적합성(適合性)을 검증(檢證)하였다. 검증결과(檢證結果), CAPM과 APM모두 한국증권시장(韓國證券市場)에서 적용가능(適用可能)한 것으로 나타났다. 특히 APM의 경우에는 요인(要因)이 2개, 6개와 10개로 증가시 어떤 경우에도 적용가능한 것으로 나타났다. 이는 기대수익율(期待收益率)의 설명력을 높이기 위하여 몇 개의 가격화(價格化) 요인(要因)이 APM에 영향을 미치는 가를 연구하는 전통적인 검증방법(檢證方法)은 큰 의미가 없는 것으로 나타났다.

  • PDF

Derivation of the Critical Minimum Values of the Multiple Correlation Coefficient for Augmenting Hydrologic Samples (수문자료 확충을 위한 다중상관계수의 한계최소치 유도)

  • 허준행
    • Water for future
    • /
    • v.27 no.1
    • /
    • pp.133-140
    • /
    • 1994
  • The augmenting hydrologic data using a correlation procedue has been used to improve the estimates of the mean and variance at the site of interest with short record when one or more nearby sites with longer records are available. The variance of the unbiased maximum likelihood estimator of ${{\sigma}_v}^2$ derived by Moran based on the multivariate normal distribution is modified into the form of Matalas and jacobs for the bivariate normal distribution to get the critical minimum values of the multiple correlation coefficient which give the improvement for estimation the variance at the site of interest. Those values are tabulated for various lengths of records and the number of sites.

  • PDF

Mapping of Temperature and Rainfall Using DEM and Multivariate Kriging (수치표고모델과 다변량 크리깅을 이용한 기온 및 강수 분포도 작성)

  • Park, No-Wook;Jang, Dong-Ho
    • Journal of the Korean Geographical Society
    • /
    • v.43 no.6
    • /
    • pp.1002-1015
    • /
    • 2008
  • We investigate the potential of digital elevation model and multivariate geostatistical kriging in mapping of temperature and rainfall based on sparse weather station observations. By using elevation data which have reasonable correlation with temperature and rainfall, and are exhaustively sampled in the study area, we try to generate spatial distributions of temperature and rainfall which well reflect topographic effects and have less smoothing effects. To illustrate the applicability of this approach, we carried out a case study of Jeju island using observation data acquired in January, April, August, and October, 2005. From the case study results, accounting for elevation via colocated cokriging could reflect detailed topographic characteristics in the study area with less smoothing effects. Colocated cokriging also showed much improved prediction capability, compared to that of traditional univariate ordinary kriging. According to the increase of the magnitude of correlation between temperature or rainfall and elevation, much improved prediction capability could be obtained. The decrease of relative nugget effects also resulted in the improvement of prediction capability.

Estimation of Spatial Distribution Using the Gaussian Mixture Model with Multivariate Geoscience Data (다변량 지구과학 데이터와 가우시안 혼합 모델을 이용한 공간 분포 추정)

  • Kim, Ho-Rim;Yu, Soonyoung;Yun, Seong-Taek;Kim, Kyoung-Ho;Lee, Goon-Taek;Lee, Jeong-Ho;Heo, Chul-Ho;Ryu, Dong-Woo
    • Economic and Environmental Geology
    • /
    • v.55 no.4
    • /
    • pp.353-366
    • /
    • 2022
  • Spatial estimation of geoscience data (geo-data) is challenging due to spatial heterogeneity, data scarcity, and high dimensionality. A novel spatial estimation method is needed to consider the characteristics of geo-data. In this study, we proposed the application of Gaussian Mixture Model (GMM) among machine learning algorithms with multivariate data for robust spatial predictions. The performance of the proposed approach was tested through soil chemical concentration data from a former smelting area. The concentrations of As and Pb determined by ex-situ ICP-AES were the primary variables to be interpolated, while the other metal concentrations by ICP-AES and all data determined by in-situ portable X-ray fluorescence (PXRF) were used as auxiliary variables in GMM and ordinary cokriging (OCK). Among the multidimensional auxiliary variables, important variables were selected using a variable selection method based on the random forest. The results of GMM with important multivariate auxiliary data decreased the root mean-squared error (RMSE) down to 0.11 for As and 0.33 for Pb and increased the correlations (r) up to 0.31 for As and 0.46 for Pb compared to those from ordinary kriging and OCK using univariate or bivariate data. The use of GMM improved the performance of spatial interpretation of anthropogenic metals in soil. The multivariate spatial approach can be applied to understand complex and heterogeneous geological and geochemical features.

Geostatistical Integration of Ground Survey Data and Secondary Data for Geological Thematic Mapping (지질 주제도 작성을 위한 지표 조사 자료와 부가 자료의 지구통계학적 통합)

  • Park, No-Wook;Jang, Dong-Ho;Chi, Kwang-Hoon
    • Korean Journal of Remote Sensing
    • /
    • v.22 no.6
    • /
    • pp.581-593
    • /
    • 2006
  • Various geological thematic maps have been generated by interpolating sparsely sampled ground survey data and geostatistical kriging that can consider spatial correlation between neighboring data has widely been used. This paper applies multi-variate geostatistical algorithms to integrate secondary information with sparsely sampled ground survey data for geological thematic mapping. Simple kriging with local means and kriging with an external drift are applied among several multi-variate geostatistical algorithms. Two case studies for spatial mapping of groundwater level and grain size have been carried out to illustrate the effectiveness of multi-variate geostatistical algorithms. A digital elevation model and IKONOS remote sensing imagery were used as secondary information in two case studies. Two multi-variate geostatistical algorithms, which can account for both spatial correlation of neighboring data and secondary data, showed smaller prediction errors and more local variations than those of ordinary kriging and linear regression. The benefit of applying the multi-variate geostatistical algorithms, however, depends on sampling density, magnitudes of correlation between primary and secondary data, and spatial correlation of primary data. As a result, the experiment for spatial mapping of grain size in which the effects of those factors were dominant showed that the effect of using the secondary data was relatively small than the experiment for spatial mapping of groundwater level.