• Title/Summary/Keyword: statistics based method

Search Result 2,157, Processing Time 0.027 seconds

Spatial Clustering Method Via Generalized Lasso (Generalized Lasso를 이용한 공간 군집 기법)

  • Song, Eunjung;Choi, Hosik;Hwang, Seungsik;Lee, Woojoo
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.4
    • /
    • pp.561-575
    • /
    • 2014
  • In this paper, we propose a penalized likelihood method to detect local spatial clusters associated with disease. The key computational algorithm is based on genlasso by Tibshirani and Taylor (2011). The proposed method has two main advantages over Kulldorff's method which is popoular to detect local spatial clusters. First, it is not needed to specify a proper cluster size a priori. Second, any type of covariate can be incorporated and, it is possible to find local spatial clusters adjusted for some demographic variables. We illustrate our proposed method using tuberculosis data from Seoul.

A Study on the Inference of Improving the Service Quality of Fine Dust Statistics on the Quality of Citizen's Life (미세먼지 통계 서비스 품질향상이 시민 삶의 질에 미치는 영향에 관한 연구)

  • Jang, Eun Mi;Suh, Eung Kyo
    • The Journal of Information Systems
    • /
    • v.30 no.3
    • /
    • pp.47-64
    • /
    • 2021
  • Purpose This study measures the degree of improvement in statistical quality experienced by data users when the data of a more convenient measurement method is extended to the analysis target to improve the quality of fine dust statistics service, and the method of expressing analysis data is revised. Ultimately, the main purpose is to explore how it can affect the quality of life of citizens. Design/methodology/approach As it was an issue the emerged as the most important issue at the time, various parties (government, private company, academia, civic groups, etc.) conducted multifaced research on fine dust, but they all focused on measuring technology and demonstrating its effectiveness there was only. This researcher redesigned the study from the viewpoint of statistical data users by changing the research subject from the technology itself to user, different from the existing research cases. The questionnaire method and structural equation were used in the study, and fine dust statistics generated through the existing method and the expanded/revised method were provided and compared for a total of 200 people. Findings Based on the results of the study, I would like to suggest what each entity should ultimately focus on to resolve the fine dust issue in the future.

Leave-one-out Bayesian model averaging for probabilistic ensemble forecasting

  • Kim, Yongdai;Kim, Woosung;Ohn, Ilsang;Kim, Young-Oh
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.1
    • /
    • pp.67-80
    • /
    • 2017
  • Over the last few decades, ensemble forecasts based on global climate models have become an important part of climate forecast due to the ability to reduce uncertainty in prediction. Moreover in ensemble forecast, assessing the prediction uncertainty is as important as estimating the optimal weights, and this is achieved through a probabilistic forecast which is based on the predictive distribution of future climate. The Bayesian model averaging has received much attention as a tool of probabilistic forecasting due to its simplicity and superior prediction. In this paper, we propose a new Bayesian model averaging method for probabilistic ensemble forecasting. The proposed method combines a deterministic ensemble forecast based on a multivariate regression approach with Bayesian model averaging. We demonstrate that the proposed method is better in prediction than the standard Bayesian model averaging approach by analyzing monthly average precipitations and temperatures for ten cities in Korea.

The Identification Of Multiple Outliers

  • Park, Jin-Pyo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.11 no.2
    • /
    • pp.201-215
    • /
    • 2000
  • The classical method for regression analysis is the least squares method. However, if the data contain significant outliers, the least squares estimator can be broken down by outliers. To remedy this problem, the robust methods are important complement to the least squares method. Robust methods down weighs or completely ignore the outliers. This is not always best because the outliers can contain some very important information about the population. If they can be detected, the outliers can be further inspected and appropriate action can be taken based on the results. In this paper, I propose a sequential outlier test to identify outliers. It is based on the nonrobust estimate and the robust estimate of scatter of a robust regression residuals and is applied in forward procedure, removing the most extreme data at each step, until the test fails to detect outliers. Unlike other forward procedures, the present one is unaffected by swamping or masking effects because the statistics is based on the robust regression residuals. I show the asymptotic distribution of the test statistics and apply the test to several real data and simulated data for the test to be shown to perform fairly well.

  • PDF

A Robust Optimization Using the Statistics Based on Kriging Metamodel

  • Lee Kwon-Hee;Kang Dong-Heon
    • Journal of Mechanical Science and Technology
    • /
    • v.20 no.8
    • /
    • pp.1169-1182
    • /
    • 2006
  • Robust design technology has been applied to versatile engineering problems to ensure consistency in product performance. Since 1980s, the concept of robust design has been introduced to numerical optimization field, which is called the robust optimization. The robustness in the robust optimization is determined by a measure of insensitiveness with respect to the variation of a response. However, there are significant difficulties associated with the calculation of variations represented as its mean and variance. To overcome the current limitation, this research presents an implementation of the approximate statistical moment method based on kriging metamodel. Two sampling methods are simultaneously utilized to obtain the sequential surrogate model of a response. The statistics such as mean and variance are obtained based on the reliable kriging model and the second-order statistical approximation method. Then, the simulated annealing algorithm of global optimization methods is adopted to find the global robust optimum. The mathematical problem and the two-bar design problem are investigated to show the validity of the proposed method.

Modified information criterion for testing changes in generalized lambda distribution model based on confidence distribution

  • Ratnasingam, Suthakaran;Buzaianu, Elena;Ning, Wei
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.301-317
    • /
    • 2022
  • In this paper, we propose a change point detection procedure based on the modified information criterion in a generalized lambda distribution (GLD) model. Simulations are conducted to obtain empirical critical values of the proposed test statistic. We have also conducted simulations to evaluate the performance of the proposed methods comparing to the log-likelihood method in terms of power, coverage probability, and confidence sets. Our results indicate that, under various conditions, the proposed method modified information criterion (MIC) approach shows good finite sample properties. Furthermore, we propose a new goodness-of-fit testing procedure based on the energy distance to evaluate the asymptotic null distribution of our test statistic. Two real data applications are provided to illustrate the use of the proposed method.

HMM Based Endpoint Detection for Speech Signals

  • Lee Yonghyung;Oh Changhyuck
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2001.11a
    • /
    • pp.75-76
    • /
    • 2001
  • An endpoint detection method for speech signals utilizing hidden Markov model(HMM) is proposed. It turns out that the proposed algorithm is quite satisfactory to apply isolated word speech recognition.

  • PDF

Improving Efficiency of Usage Statistics Collection and Analysis in E-Journal Consortia (컨소시엄 기반 전자저널 이용통계 수집 및 분석 개선 방안)

  • Jung, Young-Im;Kim, Jeong-Hwan
    • Journal of the Korean Society for information Management
    • /
    • v.29 no.2
    • /
    • pp.7-25
    • /
    • 2012
  • The proliferating use of e-journals has led increasing interest in collecting and analyzing usage statistic information. However, the existing manual method and simple journal usage reports provided by publishers hinder the effective collection of large-scale usage statistics and the comprehensive/in-depth analysis on them. Thus we have proposed a hybrid automatic method of collecting e-journal usage statistics based on screen scraping and SUSHI protocol. In addition, the generation method of summary statistics presented in graphs, charts and tables has been suggested in this study. By utilizing the suggested system and analysis data, librarians can compose various reports on budget or operation of the libraries.

New Bootstrap Method for Autoregressive Models

  • Hwang, Eunju;Shin, Dong Wan
    • Communications for Statistical Applications and Methods
    • /
    • v.20 no.1
    • /
    • pp.85-96
    • /
    • 2013
  • A new bootstrap method combined with the stationary bootstrap of Politis and Romano (1994) and the classical residual-based bootstrap is applied to stationary autoregressive (AR) time series models. A stationary bootstrap procedure is implemented for the ordinary least squares estimator (OLSE), along with classical bootstrap residuals for estimated errors, and its large sample validity is proved. A finite sample study numerically compares the proposed bootstrap estimator with the estimator based on the classical residual-based bootstrapping. The study shows that the proposed bootstrapping is more effective in estimating the AR coefficients than the residual-based bootstrapping.

Ensemble approach for improving prediction in kernel regression and classification

  • Han, Sunwoo;Hwang, Seongyun;Lee, Seokho
    • Communications for Statistical Applications and Methods
    • /
    • v.23 no.4
    • /
    • pp.355-362
    • /
    • 2016
  • Ensemble methods often help increase prediction ability in various predictive models by combining multiple weak learners and reducing the variability of the final predictive model. In this work, we demonstrate that ensemble methods also enhance the accuracy of prediction under kernel ridge regression and kernel logistic regression classification. Here we apply bagging and random forests to two kernel-based predictive models; and present the procedure of how bagging and random forests can be embedded in kernel-based predictive models. Our proposals are tested under numerous synthetic and real datasets; subsequently, they are compared with plain kernel-based predictive models and their subsampling approach. Numerical studies demonstrate that ensemble approach outperforms plain kernel-based predictive models.