• Title/Summary/Keyword: statistics based method

Search Result 2,135, Processing Time 0.032 seconds

Estimating dose-response curves using splines: a nonparametric Bayesian knot selection method

  • Lee, Jiwon;Kim, Yongku;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.287-299
    • /
    • 2022
  • In radiation epidemiology, the excess relative risk (ERR) model is used to determine the dose-response relationship. In general, the dose-response relationship for the ERR model is assumed to be linear, linear-quadratic, linear-threshold, quadratic, and so on. However, since none of these functions dominate other functions for expressing the dose-response relationship, a Bayesian semiparametric method using splines has recently been proposed. Thus, we improve the Bayesian semiparametric method for the selection of the tuning parameters for splines as the number and location of knots using a Bayesian knot selection method. Equally spaced knots cannot capture the characteristic of radiation exposed dose distribution which is highly skewed in general. Therefore, we propose a nonparametric Bayesian knot selection method based on a Dirichlet process mixture model. Inference of the spline coefficients after obtaining the number and location of knots is performed in the Bayesian framework. We apply this approach to the life span study cohort data from the radiation effects research foundation in Japan, and the results illustrate that the proposed method provides competitive curve estimates for the dose-response curve and relatively stable credible intervals for the curve.

Analysis of Missing Data Using an Empirical Bayesian Method (경험적 베이지안 방법을 이용한 결측자료 연구)

  • Yoon, Yong Hwa;Choi, Boseung
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.1003-1016
    • /
    • 2014
  • Proper missing data imputation is an important procedure to obtain superior results for data analysis based on survey data. This paper deals with both a model based imputation method and model estimation method. We utilized a Bayesian method to solve a boundary solution problem in which we applied a maximum likelihood estimation method. We also deal with a missing mechanism model selection problem using forecasting results and a comparison between model accuracies. We utilized MWPE(modified within precinct error) (Bautista et al., 2007) to measure prediction correctness. We applied proposed ML and Bayesian methods to the Korean presidential election exit poll data of 2012. Based on the analysis, the results under the missing at random mechanism showed superior prediction results than under the missing not at random mechanism.

Geovisualization of Migration Statistics Using Flow Mapping Based on Web GIS (Web GIS 기반 유선도 작성을 통한 인구이동통계의 지리적 시각화)

  • Kim, Kam-Young;Lee, Sang-Il
    • Journal of the Korean Geographical Society
    • /
    • v.47 no.2
    • /
    • pp.268-281
    • /
    • 2012
  • In spite of the usefulness of migration statistics in spatially understanding social processes and identifying social effects of spatial processes, services and analyses of the statistics have been restricted due to the complexity of their data structure. In addition, flow mapping functionality which is a useful method to explore and visualize the migration statistics has yet to be fully represented in modern GIS applications. Given this, the purpose of this research is to demonstrate the possibility of flow mapping and the exploratory spatial analysis of the migration statistics in a Web GIS environment. For this, the characteristics of the statistics were examined from database, GIS, and cartographic perspectives. Then, O-D structure of the migration statistics was converted to spatial data appropriate to f low mapping based on the characteristics. The interface of Web GIS is specialized the migration statistics and provides exploratory visualization by allowing dynamic interactions such as spatial focusing and attribute filtering.

  • PDF

Image Enhancement for Epigraphic Image Using Adaptive Process Based on Local Statistics (국부통계근거 적응처리에 의한 금석문영상 향상)

  • Hwang, Jae-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.2 s.314
    • /
    • pp.37-45
    • /
    • 2007
  • We propose an adaptive image enhancement method for epigraphic images, which is based on local statistics. Local statistics of the image are utilized for adaptive realization of the enhancement, that controls the contribution of the smoothing or sharpening paths. Image contrast enhancement occurs in details and noises are suppressed in smooth areas. For modeling the epigraphic image, pre~process is achieved by HSDI(Hanzi squeezed digital image). We have calculated the local statistics from this HSDI model. Application of this approach to HSDI has shown that processing not only smooths the background areas but also improves the subtle variations of edges, so that the word regions can be enhanced. Experimental results show that the proposed algorithm has better performance than the conventional image enhancement ones.

Nonparametric Kernel Regression Function Estimation with Bootstrap Method

  • Kim, Dae-Hak
    • Journal of the Korean Statistical Society
    • /
    • v.22 no.2
    • /
    • pp.361-368
    • /
    • 1993
  • In recent years, kernel type estimates are abundant. In this paper, we propose a bandwidth selection method for kernel regression of fixed design based on bootstrap procedure. Mathematical properties of proposed bootstrap-based bandwidth selection method are discussed. Performance of the proposed method for small sample case is compared with that of cross-validation method via a simulation study.

  • PDF

APPROXIMATION TO THE CUMULATIVE NORMAL DISTRIBUTION USING HYPERBOLIC TANGENT BASED FUNCTIONS

  • Yun, Beong-In
    • Journal of the Korean Mathematical Society
    • /
    • v.46 no.6
    • /
    • pp.1267-1276
    • /
    • 2009
  • This paper presents a method for approximation of the standard normal distribution by using hyperbolic tangent based functions. The presented approximate formula for the cumulative distribution depends on one numerical coefficient only, and its accuracy is admissible. Furthermore, in some particular cases, closed forms of inverse formulas are derived. Numerical results of the present method are compared with those of an existing method.

A Comparative Study of a Robust Estimate Method for Abnormal Traffic Detection (이상 트래픽 탐지를 위한 로버스트 추정 방법 비교 연구)

  • Jung, Jae-Yoon;Kim, Sahm
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.4
    • /
    • pp.517-525
    • /
    • 2011
  • This paper shows the performance evaluation of a robust estimator based on the GARCH model. We first introduce the method of a robust estimate in the GARCH model and the method of an outlier detection in the GARCH model. The results of the real internet traffic data show the out-performance of the robust estimator over the outlier detection method in the GARCH model. In addition, the method of the robust estimate is less complex than the method of the outlier detection method in the GARCH model.

Estimations of Forest Growing Stocks in Small-area Level Considering Local Forest Characteristics (산림의 지역적 특성을 고려한 시군구 임목축적량 통계 산출 기법 개발)

  • Kim, Eun-Sook;Kim, Cheol-Min
    • Journal of Korean Society of Forest Science
    • /
    • v.104 no.1
    • /
    • pp.117-126
    • /
    • 2015
  • Forest statistics of local administrative districts have many social needs, nevertheless we have some difficulties for working out an accurate statistics because of insufficient data in small-area level. Thus, new small-area estimation method has to set aside additional data, decrease errors of statistics and consider the local forest characteristics at the same time. In this study, we researched the spatial divisions that can set aside additional data for statistics production and satisfy the major premise, which is "forest characteristics of spatial divisions have to be equal to that of small-area". And we compared synthetic estimation methods based on three different spatial divisions(provinces, neighbor districts and new expanded districts). New expanded districts were divided based on the criteria of climate, soil type and tree species composition that affects local forest characteristics. Small-area statistics were assessed in terms of the ability to estimate local forest characteristics and consistency within large-area statistics. As a result, new expanded districts synthetic estimation was assessed to calculate statistics that reflects local forest characteristics better than other two estimation methods. Moreover, this synthetic estimation method produced the statistics that was included within 95% confidence interval of large-area statistics and was the closer to large-area statistics than the neighbor districts synthetic estimation.

Local Linear Logistic Classification of Microarray Data Using Orthogonal Components (직교요인을 이용한 국소선형 로지스틱 마이크로어레이 자료의 판별분석)

  • Baek, Jang-Sun;Son, Young-Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.587-598
    • /
    • 2006
  • The number of variables exceeds the number of samples in microarray data. We propose a nonparametric local linear logistic classification procedure using orthogonal components for classifying high-dimensional microarray data. The proposed method is based on the local likelihood and can be applied to multi-class classification. We applied the local linear logistic classification method using PCA, PLS, and factor analysis components as new features to Leukemia data and colon data, and compare the performance of the proposed method with the conventional statistical classification procedures. The proposed method outperforms the conventional ones for each component, and PLS has shown best performance when it is embedded in the proposed method among the three orthogonal components.