• Title/Summary/Keyword: nonparametric regression

Search Result 191, Processing Time 0.022 seconds

Type of Statistical Methods and Errors in the Journal of Korean Academy of Fundamentals of Nursing (기본간호학회지 게재 논문의 통계학적 방법 유형과 오류)

  • Choi, Eunhee
    • Journal of Korean Academy of Fundamentals of Nursing
    • /
    • v.22 no.4
    • /
    • pp.452-457
    • /
    • 2015
  • Purpose: In nursing research, studies using statistical methods are required and have increased. In this study, some statistical methods using in nursing study are summarized and appropriate usage is proposed. Methods: Twenty-five original articles from the Journal of Korean Academy of Fundamentals Nursing were reviewed. Statistical methods used in the Journal of Fundamentals Nursing were classified and common errors were presented. Results: Seventy-six statistical analysis were performed in the 25 studies. Among the articles, 28 cases contained errors. Most errors occurred in linear regression analysis and nonparametric analysis. Conclusion: When the use of statistical method is applied inappropriately, the result bring out a serious error. In order to ensure reliability and validity of study, researchers should recognize clear application and usage of statistical methods.

Robustness, Data Analysis, and Statistical Modeling: The First 50 Years and Beyond

  • Barrios, Erniel B.
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.6
    • /
    • pp.543-556
    • /
    • 2015
  • We present a survey of contributions that defined the nature and extent of robust statistics for the last 50 years. From the pioneering work of Tukey, Huber, and Hampel that focused on robust location parameter estimation, we presented various generalizations of these estimation procedures that cover a wide variety of models and data analysis methods. Among these extensions, we present linear models, clustered and dependent observations, times series data, binary and discrete data, models for spatial data, nonparametric methods, and forward search methods for outliers. We also present the current interest in robust statistics and conclude with suggestions on the possible future direction of this area for statistical science.

Posterior Inference in Single-Index Models

  • Park, Chun-Gun;Yang, Wan-Yeon;Kim, Yeong-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.1
    • /
    • pp.161-168
    • /
    • 2004
  • A single-index model is useful in fields which employ multidimensional regression models. Many methods have been developed in parametric and nonparametric approaches. In this paper, posterior inference is considered and a wavelet series is thought of as a function approximated to a true function in the single-index model. The posterior inference needs a prior distribution for each parameter estimated. A prior distribution of each coefficient of the wavelet series is proposed as a hierarchical distribution. A direction $\beta$ is assumed with a unit vector and affects estimate of the true function. Because of the constraint of the direction, a transformation, a spherical polar coordinate $\theta$, of the direction is required. Since the posterior distribution of the direction is unknown, we apply a Metropolis-Hastings algorithm to generate random samples of the direction. Through a Monte Carlo simulation we investigate estimates of the true function and the direction.

Monitoring of Gene Regulations Using Average Rank in DNA Microarray: Implementation of R

  • Park, Chang-Soon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.4
    • /
    • pp.1005-1021
    • /
    • 2007
  • Traditional procedures for DNA microarray data analysis are to preprocess and normalize the gene expression data, and then to analyze the normalized data using statistical tests. Drawbacks of the traditional methods are: genuine biological signal may be unwillingly eliminated together with artifacts, the limited number of arrays per gene make statistical tests difficult to use the normality assumption or nonparametric method, and genes are tested independently without consideration of interrelationships among genes. A novel method using average rank in each array is proposed to eliminate such drawbacks. This average rank method monitors differentially regulated genes among genetically different groups and the selected genes are somewhat different from those selected by traditional P-value method. Addition of genes selected by the average rank method to the traditional method will provide better understanding of genetic differences of groups.

  • PDF

A FRAMEWORK TO UNDERSTAND THE ASYMPTOTIC PROPERTIES OF KRIGING AND SPLINES

  • Furrer Eva M.;Nychka Douglas W.
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.1
    • /
    • pp.57-76
    • /
    • 2007
  • Kriging is a nonparametric regression method used in geostatistics for estimating curves and surfaces for spatial data. It may come as a surprise that the Kriging estimator, normally derived as the best linear unbiased estimator, is also the solution of a particular variational problem. Thus, Kriging estimators can also be interpreted as generalized smoothing splines where the roughness penalty is determined by the covariance function of a spatial process. We build off the early work by Silverman (1982, 1984) and the analysis by Cox (1983, 1984), Messer (1991), Messer and Goldstein (1993) and others and develop an equivalent kernel interpretation of geostatistical estimators. Given this connection we show how a given covariance function influences the bias and variance of the Kriging estimate as well as the mean squared prediction error. Some specific asymptotic results are given in one dimension for Matern covariances that have as their limit cubic smoothing splines.

Negative Binomial Varying Coefficient Partially Linear Models

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.809-817
    • /
    • 2012
  • We propose a semiparametric inference for a generalized varying coefficient partially linear model(VCPLM) for negative binomial data. The VCPLM is useful to model real data in that varying coefficients are a special type of interaction between explanatory variables and partially linear models fit both parametric and nonparametric terms. The negative binomial distribution often arise in modelling count data which usually are overdispersed. The varying coefficient function estimators and regression parameters in generalized VCPLM are obtained by formulating a penalized likelihood through smoothing splines for negative binomial data when the shape parameter is known. The performance of the proposed method is then evaluated by simulations.

Efficient Markov Chain Monte Carlo for Bayesian Analysis of Neural Network Models

  • Paul E. Green;Changha Hwang;Lee, Sangbock
    • Journal of the Korean Statistical Society
    • /
    • v.31 no.1
    • /
    • pp.63-75
    • /
    • 2002
  • Most attempts at Bayesian analysis of neural networks involve hierarchical modeling. We believe that similar results can be obtained with simpler models that require less computational effort, as long as appropriate restrictions are placed on parameters in order to ensure propriety of posterior distributions. In particular, we adopt a model first introduced by Lee (1999) that utilizes an improper prior for all parameters. Straightforward Gibbs sampling is possible, with the exception of the bias parameters, which are embedded in nonlinear sigmoidal functions. In addition to the problems posed by nonlinearity, direct sampling from the posterior distributions of the bias parameters is compounded due to the duplication of hidden nodes, which is a source of multimodality. In this regard, we focus on sampling from the marginal posterior distribution of the bias parameters with Markov chain Monte Carlo methods that combine traditional Metropolis sampling with a slice sampler described by Neal (1997, 2001). The methods are illustrated with data examples that are largely confined to the analysis of nonparametric regression models.

Air Pollution and Daily Mortality in Busan using a Time Series Analysis (시계열자료를 이용한 대기오염과 일별 사망수의 관련성 분석)

  • 서화숙;정효준;이홍근
    • Journal of Environmental Science International
    • /
    • v.11 no.10
    • /
    • pp.1061-1068
    • /
    • 2002
  • To identify possible associations with concentrations of ambient air pollutants and daily mortality in Busan, this study assessed the effects of air pollution for the time period 1999-2000. Poisson regression analysis by Generalized Additive Model were conducted considering trend, season, meteorology, and day-of-the-week as confounders in a nonparametric approach. Busan had a 10% increase in mortality in persons aged 65 and older(95% Cl : 1.01-1.10) in association with IQR in $NO_2$(lagged 2 days). An increase of $NO_2$(lagged 2days) was associated with a 4% increase in respiratory mortality(Cl : 1.02-1.11) and CO(lagged 1 day) showed a 3% increase(Cl : 1.00-1.07).

SOME PROPERTIES OF SIMEX ESTIMATOR IN PARTIALLY LINEAR MEASUREMENT ERROR MODEL

  • Meeseon Jeong;Kim, Choongrak
    • Journal of the Korean Statistical Society
    • /
    • v.32 no.1
    • /
    • pp.85-92
    • /
    • 2003
  • We consider the partially linear model E(Y) : X$^{t}$ $\beta$+η(Z) when the X's are measured with additive error. The semiparametric likelihood estimation ignoring the measurement error gives inconsistent estimator for both $\beta$ and η(.). In this paper we suggest the SIMEX estimator for f to correct the bias induced by measurement error, and explore its properties. We show that the rational linear extrapolant is proper in extrapolation step in the sense that the SIMEX method under this extrapolant gives consistent estimator It is also shown that the SIMEX estimator is asymptotically equivalent to the semiparametric version of the usual parametric correction for attenuation suggested by Liang et al. (1999) A simulation study is given to compare two variance estimating methods for SIMEX estimator.