• Title/Summary/Keyword: Statistical error

Search Result 1,765, Processing Time 0.025 seconds

Improved Statistical Language Model for Context-sensitive Spelling Error Candidates (문맥의존 철자오류 후보 생성을 위한 통계적 언어모형 개선)

  • Lee, Jung-Hun;Kim, Minho;Kwon, Hyuk-Chul
    • Journal of Korea Multimedia Society
    • /
    • v.20 no.2
    • /
    • pp.371-381
    • /
    • 2017
  • The performance of the statistical context-sensitive spelling error correction depends on the quality and quantity of the data for statistical language model. In general, the size and quality of data in a statistical language model are proportional. However, as the amount of data increases, the processing speed becomes slower and storage space also takes up a lot. We suggest the improved statistical language model to solve this problem. And we propose an effective spelling error candidate generation method based on a new statistical language model. The proposed statistical model and the correction method based on it improve the performance of the spelling error correction and processing speed.

An Analysis on Error Types of Graphs for Statistical Literacy Education: Ethical Problems at Data Analysis in the Statistical Problem Solving (통계적 소양 교육을 위한 그래프 오류 유형 분석: 자료 분석 단계에서의 통계 윤리 문제)

  • Tak, Byungjoo;Kim, Dabin
    • Journal of Elementary Mathematics Education in Korea
    • /
    • v.24 no.1
    • /
    • pp.1-30
    • /
    • 2020
  • This study was carried out in order to identify the error types of statistical graphs for statistical literacy education. We analyze the meaning of using graphs in statistical problem solving, and identify categories, frequencies, and contexts as the components of statistical graphs. Error types of representing categories and frequencies make statistics consumers see incorrect distributions of data by subjective point of view of statistics producers and visual illusion. Error types of providing contexts hinder the interpretation of statistical information by concealing or twisting the contexts of data. Moreover, the findings show that tasks provide standardized frame already for drawing graphs in order to avoid errors and pay attention to the process of drawing the graph rather than statistical literacy for analyzing data. We suggest some implications about statistical literacy education, ethical problems, and knowledge for teaching to be considered when teaching the statistical graph in elementary mathematics classes.

Hierarchical Bayes Estimators of the Error Variance in Two-Way ANOVA Models

  • Chang, In Hong;Kim, Byung Hwee
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.2
    • /
    • pp.315-324
    • /
    • 2002
  • For estimating the error variance under the relative squared error loss in two-way analysis of variance models, we provide a class of hierarchical Bayes estimators and then derive a subclass of the hierarchical Bayes estimators, each member of which dominates the best multiple of the error sum of squares which is known to be minimax. We also identify a subclass of non-minimax hierarchical Bayes estimators.

Statistical Issues in Genomic Cohort Studies (유전체 코호트 연구의 주요 통계학적 과제)

  • Park, So-Hee
    • Journal of Preventive Medicine and Public Health
    • /
    • v.40 no.2
    • /
    • pp.108-113
    • /
    • 2007
  • When conducting large-scale cohort studies, numerous statistical issues arise from the range of study design, data collection, data analysis and interpretation. In genomic cohort studies, these statistical problems become more complicated, which need to be carefully dealt with. Rapid technical advances in genomic studies produce enormous amount of data to be analyzed and traditional statistical methods are no longer sufficient to handle these data. In this paper, we reviewed several important statistical issues that occur frequently in large-scale genomic cohort studies, including measurement error and its relevant correction methods, cost-efficient design strategy for main cohort and validation studies, inflated Type I error, gene-gene and gene-environment interaction and time-varying hazard ratios. It is very important to employ appropriate statistical methods in order to make the best use of valuable cohort data and produce valid and reliable study results.

First Order Difference-Based Error Variance Estimator in Nonparametric Regression with a Single Outlier

  • Park, Chun-Gun
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.3
    • /
    • pp.333-344
    • /
    • 2012
  • We consider some statistical properties of the first order difference-based error variance estimator in nonparametric regression models with a single outlier. So far under an outlier(s) such difference-based estimators has been rarely discussed. We propose the first order difference-based estimator using the leave-one-out method to detect a single outlier and simulate the outlier detection in a nonparametric regression model with the single outlier. Moreover, the outlier detection works well. The results are promising even in nonparametric regression models with many outliers using some difference based estimators.

Alternative Tests for the Nested Error Component Regression Model

  • Song, Seuck-Heun;Jung, Byoung-Cheol
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.1
    • /
    • pp.63-80
    • /
    • 2000
  • We consider the panel data regression model with nested error componets. In this paper, the several Lagrange Multipler tests for the nested error component model are derived. These tests extend the earlier work of Honda(1985), Moulton and Randolph(1989), Baltagi, et al.(1992) and King and Wu(1997) to the nested error component case. Monte Carlo experiments are conducted to study the performance of these LM tests.

  • PDF

Hierarchical Bayes Estimators of the Error Variance in Balanced Fixed-Effects Two-Way ANOVA Models

  • Kim, Byung-Hwee;Dong, Kyung-Hwa
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.2
    • /
    • pp.487-500
    • /
    • 1999
  • We propose a class of hierarchical Bayes estimators of the error variance under the relative squared error loss in balanced fixed-effects two-way analysis of variance models. Also we provide analytic expressions for the risk improvement of the hierarchical Bayes estimators over multiples of the error sum of squares. Using these expressions we identify a subclass of the hierarchical Bayes estimators each member of which dominates the best multiple of the error sum of squares which is known to be minimax. Numerical values of the percentage risk improvement are given in some special cases.

  • PDF

On the Performance of Iterated Wild Bootstrap Interval Estimation of the Mean Response

  • Kim, Woo-Chul;Ko, Duk-Hyun
    • Journal of the Korean Statistical Society
    • /
    • v.24 no.2
    • /
    • pp.551-562
    • /
    • 1995
  • We consider the iterated bootstrap method in regression model with heterogeneous error variances. The iterated wild bootstrap confidence intervla of the mean response is considered. It is shown that the iterated wild bootstrap confidence interval has coverage error of order $n^{-1}$ wheresa percentile method interval has an error of order $n^{-1/2}$. The simulation results reveal that the iterated bootstrap method calibrates the coverage error of percentile method interval successfully even for the small sample size.

  • PDF

Estimation of the Polynomial Errors-in-variables Model with Decreasing Error Variances

  • Moon, Myung-Sang;R. F. Gunst
    • Journal of the Korean Statistical Society
    • /
    • v.23 no.1
    • /
    • pp.115-134
    • /
    • 1994
  • Polynomial errors-in-variables model with one predictor variable and one response variable is defined and an estimator of model is derived following the Booth's linear model estimation procedure. Since polynomial model is nonlinear function of the unknown regression coefficients and error-free predictors, it is nonlinear model in errors-in-variables model. As a result of applying linear model estimation method to nonlinear model, some additional assumptions are necessary. Hence, an estimator is derived under the assumption that the error variances are decrasing as sample size increases. Asymptotic propoerties of the derived estimator are provided. A simulation study is presented to compare the small sample properties of the derived estimator with those of OLS estimator.

  • PDF

Refractive Error in 7-9 Year-old Korea Children (7세부터 9세 사이의 한국인 어린이의 굴절 이상)

  • Kim, Douk-Hoon;Alberto, Mercedita O.
    • Journal of Korean Clinical Health Science
    • /
    • v.2 no.3
    • /
    • pp.203-208
    • /
    • 2014
  • Purpose. To analysis the refractive error in 7-9 year-old Korea children. Methods. From July 2013 to June 2014, two hundred eighty two subjects were performed in refraction test using the Auto-Refractometry. Results. The refractive error by spherical equivalent among all subjects was myopia 47.58%, emmetropia 42.35%, astigmatism 32.33%, and hyperopia 8.76%. Myopia was more common in female than males although the difference was not statically significant. The axis of astigmatism was with the rule in 65%, against the rule in 31.5%, and oblique in 3.5% There was a statistical significance between 7 year and 9 year of male in the spherical equivalent power(p=0.010). Also there was a statistical significance between 7 years and 9 years of female in the spherical equivalent power(p=0.036). However, there was not a statistical significance between male and female in spherical equivalent power(p>0.5). Conclusions. In this study, myopia was the most common refractive error. On the other hand, The prevalence of the axis of astigmatism was the with- the- rule. The spherical equivalent of refractive error was similar results between male and female. However The refractive error was different style with aging. these data suggested that the analysis of the refractive error at young children can provide the information of useful diagnosis for the correction of visual acuity.