• Title/Summary/Keyword: Accuracy Statistics

Search Result 799, Processing Time 0.022 seconds

Optimal Criterion of Classification Accuracy Measures for Normal Mixture (정규혼합에서 분류정확도 측도들의 최적기준)

  • Yoo, Hyun-Sang;Hong, Chong-Sun
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.3
    • /
    • pp.343-355
    • /
    • 2011
  • For a data with the assumption of the mixture distribution, it is important to find an appropriate threshold and evaluate its performance. The relationship is found of well-known nine classification accuracy measures such as MVD, Youden's index, the closest-to-(0, 1) criterion, the amended closest-to-(0, 1) criterion, SSS, symmetry point, accuracy area, TA, TR. Then some conditions of these measures are categorized into seven groups. Under the normal mixture assumption, we calculate thresholds based on these measures and obtain the corresponding type I and II errors. We could explore that which classification measure has minimum type I and II errors for estimated mixture distribution to understand the strength and weakness of these classification measures.

Issues in Developing Statistics for the Handicapped Population (장애자통계의 개선방안)

  • 최봉호;이승욱
    • Korea journal of population studies
    • /
    • v.17 no.1
    • /
    • pp.73-86
    • /
    • 1994
  • In order for handicapped people to maintain better humane life, it is necessary to get statistics of them in developing appropriate national policy. However, it is very difficult to obtain baseline statistics on regular or occasional basis. It's reason is mainly attributed to attitudes of their family's tendency to conceal any existence of such memeber in the household. As a result, the statis-tics on the handicapped population is very inaccurate and under satisfaction. We must produce such statistics periodically in time and with accuracy. Thus, this study porposes five methods which, we believe, can produce reliable statistics of thehandcapped population : 1) vitalization through enforcement of handicapped information into the registration system, 2) inclusion in population census of items related to handicapped information, 3) improvement of the physically handicapped population survey scheme, 4) utilization of hospital patients' records for development of the statistics, and 5) an estimation through the labor force survey.

  • PDF

A study on non-response bias adjusted estimation in business survey (사업체조사에서의 무응답 편향보정 추정에 관한 연구)

  • Chung, Hee Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.11-23
    • /
    • 2020
  • Sampling design should provide statistics to meet a given accuracy while saving cost and time. However, a large number of non-responses are occurring due to the deterioration of survey circumstances, which significantly reduces the accuracy of the survey results. Non-responses occur for a variety of reasons. Chung and Shin (2017, 2019) and Min and Shin (2018) found that the accuracy of estimation is improved by removing the bias caused by non-response when the response rate is an exponential or linear function of variable of interests. For that case they assumed that the error of the super population model follows normal distribution. In this study, we proposed a non-response bias adjusted estimator in the case where the error of a super population model follows the gamma distribution or the log-normal distribution in a business survey. We confirmed the superiority of the proposed estimator through simulation studies.

A study on the estimation of the credibility in an extended Buhlmann-Straub model (확장된 뷸만-스트라웁 모형에서 신뢰도 추정 연구)

  • Yi, Min-Jeong;Go, Han-Na;Choi, Seung-Kyoung;Lee, Eui-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1181-1190
    • /
    • 2010
  • When an insurer develops an insurance product, it is very critical to determine reasonable premiums, which is directly related to insurer's profits. There are three methods to determine premiums. Frist, the insurer utilizes premiums paid to the similar cases to the current one. Second, the insurer calculates premiums based on policyholder's past records. The last method is to combine the first with the second one. Based on the three methods, there are two major theories determining premiums, Limited Fluctuation Credibility Theory not based on statistical models and Greatest Accuracy Credibility Theory based on statistical models. There are well-known methods derived from Greatest Accuracy Credibility Theory, such as, Buhlmann model and Buhlmann-Straub model. In this paper, we extend the Buhlmann-Straub model to accommodate the fact that variability grows according to the number of data in practice and suggest a new non-parametric method to estimate the premiums. The suggested estimation method is also applied to the data gained from simulation and compared with the existing estimation method.

Solar Power Generation Forecast Model Using Seasonal ARIMA (SARIMA 모형을 이용한 태양광 발전량 예보 모형 구축)

  • Lee, Dong-Hyun;Jung, Ahyun;Kim, Jin-Young;Kim, Chang Ki;Kim, Hyun-Goo;Lee, Yung-Seop
    • Journal of the Korean Solar Energy Society
    • /
    • v.39 no.3
    • /
    • pp.59-66
    • /
    • 2019
  • New and renewable energy forecasts are key technology to reduce the annual operating cost of new and renewable facilities, and accuracy of forecasts is paramount. In this study, we intend to build a model for the prediction of short-term solar power generation for 1 hour to 3 hours. To this end, this study applied two time series technique, ARIMA model without considering seasonality and SARIMA model with considering seasonality, comparing which technique has better predictive accuracy. Comparing predicted errors by MAE measures of solar power generation for 1 hour to 3 hours at four locations, the solar power forecast model using ARIMA was better in terms of predictive accuracy than the solar power forecast model using SARIMA. On the other hand, a comparison of predicted error by RMSE measures resulted in a solar power forecast model using SARIMA being better in terms of predictive accuracy than a solar power forecast model using ARIMA.

A study to improve the accuracy of the naive propensity score adjusted estimator using double post-stratification method (나이브 성향점수보정 추정량의 정확성 향상을 위한 이중 사후층화 방법 연구)

  • Leesu Yeo;Key-Il Shin
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.6
    • /
    • pp.547-559
    • /
    • 2023
  • Proper handling of nonresponse in sample survey improves the accuracy of the parameter estimation. Various studies have been conducted to properly handle MAR (missing at random) nonresponse or MCAR (missing completely at random) nonresponse. When nonresponse occurs, the PSA (propensity score adjusted) estimator is commonly used as a mean estimator. The PSA estimator is known to be unbiased when known sample weights and properly estimated response probabilities are used. However, for MNAR (missing not at random) nonresponse, which is affected by the value of the study variable, since it is very difficult to obtain accurate response probabilities, bias may occur in the PSA estimator. Chung and Shin (2017, 2022) proposed a post-stratification method to improve the accuracy of mean estimation when MNAR nonresponse occurs under a non-informative sample design. In this study, we propose a double post-stratification method to improve the accuracy of the naive PSA estimator for MNAR nonresponse under an informative sample design. In addition, we perform simulation studies to confirm the superiority of the proposed method.

Statistics Quality Assessment and Improvement of Small & Medium Enterprises Survey (중소기업실태조사의 품질진단과 개선에 관한 연구)

  • Kim, Moon Sun;Chun, Sae Rom;Nam, Kyung H.
    • Journal of Korean Society for Quality Management
    • /
    • v.40 no.4
    • /
    • pp.577-588
    • /
    • 2012
  • Purpose: This paper aims to evaluate the quality of Small & Medium Enterprises(SMEs) Survey and to provide some desirable directions and improvements for the future SMEs Survey, conducted by the Government. Methods: The diagnosis were performed by employing the quantitative and qualitative approaches with the official guideline provided by the Ministry of Statistics. Results: Results show that follow-up management are evaluated relatively low in specific processes, and timeliness are evaluated relatively lower than accuracy, accessibility, relevance, comparability and consistency in statistical qualities. Conclusion: we propose the alternatives which enhance the quality level of SMEs by using the 6 tools of Quality Management for Statistics.

최근 초혼연령의 변화에 관한 소고

  • 황대희;고갑석
    • Korea journal of population studies
    • /
    • v.6 no.1
    • /
    • pp.115-126
    • /
    • 1983
  • In order for handicapped people to maintain better humane life, it is necessary to get statistics of them in developing appropriate national policy. However, it is very difficult to obtain baseline statistics on regular or occasional basis. It's reason is mainly attributed to attitudes of their family's tendency to conceal any existence of such memeber in the household. As a result, the statis-tics on the handicapped population is very inaccurate and under satisfaction. We must produce such statistics periodically in time and with accuracy. Thus, this study porposes five methods which, we believe, can produce reliable statistics of thehandcapped population : 1) vitalization through enforcement of handicapped information into the registration system, 2) inclusion in population census of items related to handicapped information, 3) improvement of the physically handicapped population survey scheme, 4) utilization of hospital patients' records for development of the statistics, and 5) an estimation through the labor force survey.

  • PDF

Prediction of fine dust PM10 using a deep neural network model (심층 신경망모형을 사용한 미세먼지 PM10의 예측)

  • Jeon, Seonghyeon;Son, Young Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.265-285
    • /
    • 2018
  • In this study, we applied a deep neural network model to predict four grades of fine dust $PM_{10}$, 'Good, Moderate, Bad, Very Bad' and two grades, 'Good or Moderate and Bad or Very Bad'. The deep neural network model and existing classification techniques (such as neural network model, multinomial logistic regression model, support vector machine, and random forest) were applied to fine dust daily data observed from 2010 to 2015 in six major metropolitan areas of Korea. Data analysis shows that the deep neural network model outperforms others in the sense of accuracy.

Prediction improvement of election polls by unstructured data analysis (비정형 데이터 분석을 통한 선거 여론조사 예측력 개선 방안 연구)

  • Park, Sunbin;Kim, Myung Joon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.655-665
    • /
    • 2018
  • Social network services (SNS) have become the most common tool for the communication of public and private opinions as well as public issues; consequently, one may form or drive public opinions to advocate by spreading positive content using SNS. Controversy for survey data based opinion poll accuracy continues in relation to response rate or sampling methodology. This study suggests complementary measures that additionally consider the sentiment analysis results of unstructured data on a social network by data crawling and sentiment dictionary adjustment process. The suggested method shows the improvement of prediction accuracy by decreasing error rates.