• Title/Summary/Keyword: nonparametric statistical method

Search Result 190, Processing Time 0.022 seconds

A Study on Variance Change Point Detection for Time Series Data in Progress (진행중인 시계열데이터에서 분산 변화점 탐지에 관한 연구)

  • Choi Hyun-Seok;Kang Hoon-Kyu;Song Gyu-Moon;Kim Tae-Yoon
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.369-377
    • /
    • 2006
  • This paper considers moving variance ratio (MVR) for valiance detection problem with time series data in progress. For testing purpose, parametric method based on F distribution and nonparametric method based on empirical distribution are compared via simulation study.

Nonparmetric Method for Identifying Effective and Safe Doses using Placement (유효하고 안전한 용량 결정에 위치를 이용한 비모수적 방법)

  • Kim, Sunhye;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1197-1205
    • /
    • 2014
  • Typical clinical dose development studies consist of the comparison of several doses of a drug with a placebo. The primary interest is to find therapeutic window that satisfying both efficacy and safety. In this paper, we propose nonparametric method for identifying effective and safe doses in linear placement using score function. The Monte Carlo simulation is adapted to estimate the power and the family-wise error rate(FWE) of proposed procedure are compared with previous methods.

A Report on the Inter-Gene Correlations in cDNA Microarray Data Sets (cDNA 마이크로어레이에서 유전자간 상관 관계에 대한 보고)

  • Kim, Byung-Soo;Jang, Jee-Sun;Kim, Sang-Cheol;Lim, Jo-Han
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.3
    • /
    • pp.617-626
    • /
    • 2009
  • A series of recent papers reported that the inter-gene correlations in Affymetrix microarray data sets were strong and long-ranged, and the assumption of independence or weak dependence among gene expression signals which was often employed without justification was in conflict with actual data. Qui et al. (2005) indicated that applying the nonparametric empirical Bayes method in which test statistics were pooled across genes for performing the statistical inference resulted in the large variance of the number of differentially expressed genes. Qui et al. (2005) attributed this effect to strong and long-ranged inter-gene correlations. Klebanov and Yakovlev (2007) demonstrated that the inter-gene correlations provided a rich source of information rather than being a nuisance in the statistical analysis and they developed, by transforming the original gene expression sequence, a sequence of independent random variables which they referred to as a ${\delta}$-sequence. We note in this report using two cDNA microarray data sets experimented in this country that the strong and long-ranged inter-gene correlations were still valid in cDNA microarray data and also the ${\delta}$-sequence of independence could be derived from the cDNA microarray data. This note suggests that the inter-gene correlations be considered in the future analysis of the cDNA microarray data sets.

Analysis of Daily Distress Symptoms: Threshold Estimation after Isolating the Distress Group (매일의 불편감 증상점수의 분석: 불편감 증후군의 탐색과 증상 변화추세의 검정)

  • Lee, Won-Nyung;Song, Hae-Hiang
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.1
    • /
    • pp.123-138
    • /
    • 2010
  • After selecting a group of women with premenstrual syndrome based on daily distress scores of 28 days, one needs to estimate threshold for the change of symptoms, which would be useful for the clinician's diagnosis in hospitals. However, a test of whether a change has occurred has to precede the estimation of the threshold. In this paper, we apply parametric and nonparametric testing methods to an example data obtained from a group of women. Nonparametric method does not assume any distributional form of distress scores and parametric testing method is based on the normal distributions of linear regression lines. Therefore, the optimal situation of both methods would be different and we will assess it with a simulation study.

Determination of Sample Sizes of Bivariate Efficacy and Safety Outcomes (이변량 효능과 안전성 이항변수의 표본수 결정방법)

  • Lee, Hyun-Hak;Song, Hae-Hiang
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.341-353
    • /
    • 2009
  • We consider sample-size determination problem motivated by comparative clinical trials where patient outcomes are characterized by a bivariate outcome of efficacy and safety. Thall and Cheng (1999) presented a sample size methodology for the case of bivariate binary outcomes. We propose a bivariate Wilcoxon-Mann-Whitney(WMW) statistics for sample-size determination for binary outcomes, and this nonparametric method can be equally used to determine sample sizes of ordinal outcomes. The two methods of sample size determination rely on the same testing strategy for the target parameters but differs in the test statistics, an asymptotic bivariate normal statistic of the transformed proportions in Thall and Cheng (1999) and nonparametric bivariate WMW statistic in the other method. Sample sizes are calculated for the two experimental oncology trials, described in Thall and Cheng (1999), and for the first trial example the sample sizes of a bivariate WMW statistic are smaller than those of Thall and Cheng (1999), while for the second trial example the reverse is true.

Main SNP Identification of Hanwoo Carcass Weight with Multifactor Dimensionality Reduction(MDR) Method (MULTIFACTOR DIMENSIONALITY REDUCTION(MDR)을 이용한 한우 도체중에서의 주요 SNP 규명)

  • Lee, Jea-Young;Kim, Dong-Chul
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.1
    • /
    • pp.53-63
    • /
    • 2008
  • It is commonly believed that disease of human or economic traits of livestock are caused not by single gene acting alone, but by multiple genes interacting with one an-other. This issue is difficult due to the limitations of parametric statistical method like as logistic regression for detection of gene effects that are dependent solely on interactions with other genes and with environmental exposures. Multifactor dimensionality reduction (MDR) nonparametric statistical method, to improve the identification of single nucleotide polymorphism (SNP) associated with the Hanwoo(Korean cattle) carcass cold weight, is applied and compared with ANOVA results.

Kernel Pattern Recognition using K-means Clustering Method (K-평균 군집방법을 이요한 가중커널분류기)

  • 백장선;심정욱
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.447-455
    • /
    • 2000
  • We propose a weighted kernel pattern recognition method using the K -means clustering algorithm to reduce computation and storage required for the full kernel classifier. This technique finds a set of reference vectors and weights which are used to approximate the kernel classifier. Since the hierarchical clustering method implemented in the 'Weighted Parzen Window (WP\V) classifier is not able to rearrange the proper clusters, we adopt the K -means algorithm to find reference vectors and weights from the more properly rearranged clusters \Ve find that the proposed method outperforms the \VP\V method for the repre~entativeness of the reference vectors and the data reduction.

  • PDF

Applying Bootstrap to Time Series Data Having Trend (추세 시계열 자료의 부트스트랩 적용)

  • Park, Jinsoo;Kim, Yun Bae;Song, Kiburm
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.38 no.2
    • /
    • pp.65-73
    • /
    • 2013
  • In the simulation output analysis, bootstrap method is an applicable resampling technique to insufficient data which are not significant statistically. The moving block bootstrap, the stationary bootstrap, and the threshold bootstrap are typical bootstrap methods to be used for autocorrelated time series data. They are nonparametric methods for stationary time series data, which correctly describe the original data. In the simulation output analysis, however, we may not use them because of the non-stationarity in the data set caused by the trend such as increasing or decreasing. In these cases, we can get rid of the trend by differencing the data, which guarantees the stationarity. We can get the bootstrapped data from the differenced stationary data. Taking a reverse transform to the bootstrapped data, finally, we get the pseudo-samples for the original data. In this paper, we introduce the applicability of bootstrap methods to the time series data having trend, and then verify it through the statistical analyses.

Constructing Simultaneous Confidence Intervals for the Difference of Proportions from Multivariate Binomial Distributions

  • Jeong, Hyeong-Chul;Kim, Dae-Hak
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.1
    • /
    • pp.129-140
    • /
    • 2009
  • In this paper, we consider simultaneous confidence intervals for the difference of proportions between two groups taken from multivariate binomial distributions in a nonparametric way. We briefly discuss the construction of simultaneous confidence intervals using the method of adjusting the p-values in multiple tests. The features of bootstrap simultaneous confidence intervals using non-pooled samples are presented. We also compute confidence intervals from the adjusted p-values of multiple tests in the Westfall (1985) style based on a pooled sample. The average coverage probabilities of the bootstrap simultaneous confidence intervals are compared with those of the Bonferroni simultaneous confidence intervals and the Sidak simultaneous confidence intervals. Finally, we give an example that shows how the proposed bootstrap simultaneous confidence intervals can be utilized through data analysis.

Semiparametric Seasonal Cointegrating Rank Selection

  • Seong, Byeong-Chan;Ahn, Sung-K.;Ch, Sin-Sup
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.791-797
    • /
    • 2011
  • This paper considers the issue of seasonal cointegrating rank selection by information criteria as the extension of Cheng and Phillips (2009). The method does not require the specification of lag length in vector autoregression, is convenient in empirical work, and is in a semiparametric context because it allows for a general short memory error component in the model with only lags related to error correction terms. Some limit properties of usual information criteria are given for the rank selection and small Monte Carlo simulations are conducted to evaluate the performances of the criteria.