• Title/Summary/Keyword: High order statistics

Search Result 551, Processing Time 0.024 seconds

Variable Selection Via Penalized Regression

  • Yoon, Young-Joo;Song, Moon-Sup
    • Communications for Statistical Applications and Methods
    • /
    • v.12 no.3
    • /
    • pp.615-624
    • /
    • 2005
  • In this paper, we review the variable-selection properties of LASSO and SCAD in penalized regression. To improve the weakness of SCAD for high noise level, we propose a new penalty function called MSCAD which relaxes the unbiasedness condition of SCAD. In order to compare MSCAD with LASSO and SCAD, comparative studies are performed on simulated datasets and also on a real dataset. The performances of penalized regression methods are compared in terms of relative model error and the estimates of coefficients. The results of experiments show that the performance of MSCAD is between those of LASSO and SCAD as expected.

Fast Simulation of Overflow Probabilities in Multiclass Queues

  • Lee, Ji-Yeon;Bae, Kyung-Soon
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.2
    • /
    • pp.287-299
    • /
    • 2007
  • We consider a multiclass queue where queued customers are served in their order of arrival at a rate which depends on the customer type. By using the asymptotic results obtained by Dabrowski et al. (2006) we calculate the sharp asymptotics of the stationary distribution of the number of customers of each class in the system and the distribution of the number of customers of each class when the total number of customers reaches a high level before emptying. We also obtain a fast simulation algorithm to estimate the overflow probability and compare it with the general simulation and asymptotic results.

INVESTIGATION OF CLOUD COVERAGE OVER ASIA WITH NOAA AVHRR TIME SERIES

  • Takeuchit Wataru;Yasuokat Yoshifumi
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.26-29
    • /
    • 2005
  • In order to compute cloud coverage statistics over Asian region, an operational scheme for masking cloud-contaminated pixels in Advanced Very High Resolution Radiometer (AVHRR) daytime data was developed, evaluated and presented. Dynamic thresholding was used with channell, 2 and 3 to automatically create a cloud mask for a single image. Then the IO-day cloud coverage imagery was generated over the whole Asian region along with cloud-free composite imagery. Finally the monthly based statistics were computed based on the derived cloud coverage imagery in terms of land cover and country. As a result, it was found that 20-day is required to acquire the cloud free data over the whole Asia using NOAA AVHRR. The to-day cloud coverage and cloud-free composite imagery derived in this research is available via the web-site http://webpanda.iis.u-tokyo.ac.jp/CloudCover/.

  • PDF

On Quantifies Estimation Using Ranked Samples with Some Applications

  • Samawi, Hani-M.
    • Journal of the Korean Statistical Society
    • /
    • v.30 no.4
    • /
    • pp.667-678
    • /
    • 2001
  • The asymptotic behavior and distribution for quantiles estimators using ranked samples are introduced. Applications of quantiles estimation on finding the normal ranges (2.5% and 97.5% percentiles) and the median of some medical characteristics and on finding the Hodges-Lehmann estimate are discussed. The conclusion of this study is, whenever perfect ranking is possible, the relative efficiency of quantiles estimation using ranked samples relative to SRS is high. This may translates to large savings in cost and time. Also, this conclusion holds even if the ranking is not perfect. Computer simulation results are given and real data from lows 65+ study is used to illustrate the method.

  • PDF

VARIABLE SELECTION VIA PENALIZED REGRESSION

  • Yoon, Young-Joo;Song, Moon-Sup
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.05a
    • /
    • pp.7-12
    • /
    • 2005
  • In this paper, we review the variable-selection properties of LASSO and SCAD in penalized regression. To improve the weakness of SCAD for high noise level, we propose a new penalty function called MSCAD which relaxes the unbiasedness condition of SCAD. In order to compare MSCAD with LASSO and SCAD, comparative studies are performed on simulated datasets and also on a real dataset. The performances of penalized regression methods are compared in terms of relative model error and the estimates of coefficients. The results of experiments show that the performance of MSCAD is between those of LASSO and SCAD as expected.

  • PDF

A Bulk Sampling Plan for Reliability Assurance (벌크재료의 신뢰성보증을 위한 샘플링검사 방식)

  • Kim, Dong-Chul;Kim, Jong-Gurl
    • Journal of the Korea Safety Management & Science
    • /
    • v.9 no.2
    • /
    • pp.123-134
    • /
    • 2007
  • This paper focuses on the in-house reliability assurance plan for the bulk materials of each company. The reliability assurance needs in essence a long time and high cost for testing the materials. In order to reduce the time and cost, accelerated life test is adopted. The bulk sampling technique was used for acceptance. Design parameters might be total sample size(segments and increments}, stress level and so on. We focus on deciding the sample size by minimizing the asymptotic variance of test statistics as well as satisfying the consumer's risk. In bulk sampling, we also induce the sample size by adapting the normal life time distribution model when the variable of the lognormal life time distribution is transformed and adapted to the model. In addition, the sample size for both the segments and increments can be induced by minimizing the asymptotic variance of test statistics of the segments and increments with consumer's risk met. We can assure the reliability of the mean life and B100p life time of the bulk materials by using the calculated minimum sample size.

Implementation of Questionnaire and Customer Satisfaction Investigation System on Internet (인터넷을 이용한 설문조사와 고객만족도조사 시스템구현)

  • Namkung, Pyong
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.3
    • /
    • pp.713-727
    • /
    • 2005
  • The advantage of an internet survey is the speed with which data can be accumulated from respondents. And this method is more economical, provides more accurate information, and has greater scope in subject coverage. Since there is used multi-media, the design of questionnaires is even more important in order to achieve high data quality.

Model selection for unstable AR process via the adaptive LASSO (비정상 자기회귀모형에서의 벌점화 추정 기법에 대한 연구)

  • Na, Okyoung
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.909-922
    • /
    • 2019
  • In this paper, we study the adaptive least absolute shrinkage and selection operator (LASSO) for the unstable autoregressive (AR) model. To identify the existence of the unit root, we apply the adaptive LASSO to the augmented Dickey-Fuller regression model, not the original AR model. We illustrate our method with simulations and a real data analysis. Simulation results show that the adaptive LASSO obtained by minimizing the Bayesian information criterion selects the order of the autoregressive model as well as the degree of differencing with high accuracy.

Locally Optimum Detection of Signals in first-order Markov Environment: 1. Test Statistics (일차 마르코프 잡음 환경에서의 국소 최적 검파: 1. 검정 통계량)

  • Lee, Ju-Mi;Park, Ju-Ho;Song, Iic-Ho;Kwon, Hyoung-Moon;Kim, Jong-Jik;Yoon, Seok-Ho
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.10C
    • /
    • pp.973-980
    • /
    • 2006
  • In most of the studies on locally optimum detection assumes independent observations. The use of an independent observation model may cause a considerable performance degradation in detection applications of modern high data rate communication systems exhibiting dependence among interference components. In this paper, we address the detection of weak known signals in multiplicative and first order Markov additive noises. In Part 1, the test statistics of the locally optimum detectors are investigated in detail. In Part 2, the asymptotic and finite sample-size performance of several detectors are obtained and compared, confirming that the dependence among interference components need to be taken into account to maintain performance appropriately.

A Study on the effects of air pollution on circulatory health using spatial data (공간 자료를 이용한 대기오염이 순환기계 건강에 미치는 영향 분석)

  • Park, Jin-Ok;Choi, Ilsu;Na, Myung Hwan
    • Journal of Korean Society for Quality Management
    • /
    • v.44 no.3
    • /
    • pp.677-688
    • /
    • 2016
  • Purpose: In this study, we examine the effects of circulatory diseases mortality in South Korea 2005-2013 using the air pollution index, Methods: We cluster the region of high risk mortality by SaTScan$^{TM}$9.3.1 and compare this result with the regional distribution of air pollution. We use the Geographically Weighted Regression (GWR) to consider the spatial heterogeneity of data collected by administrative district in order to estimate the model. As GWR is spatial analysis techniques utilizing the spatial information, regression model estimated for each region on the assumption that regression coefficients are different by region. Results: As a result of estimating model of the collected air pollution index, circulatory diseases mortality data combined with the spatial information, GWR was found to solve the problem of spatial autocorrelation and increase the fit of the model than OLS regression model. Conclusion: GWR is used to select the air pollution affecting the disease each year, the K-means cluster analysis discover the characteristics of the distribution of air pollution by region.