• 제목/요약/키워드: under-sampling

검색결과 1,096건 처리시간 0.022초

An Additive Quantitative Randomized Response Model by Cluster Sampling

  • Lee, Gi-Sung
    • 응용통계연구
    • /
    • 제25권3호
    • /
    • pp.447-456
    • /
    • 2012
  • For a sensitive survey in which the population is comprised of several clusters with a quantitative attribute, we present an additive quantitative randomized response model by cluster sampling that adapts a two-stage cluster sampling instead of a simple random sample based on Himmelfarb-Edgell's additive quantitative attribute model and Gjestvang-Singh's one. We also derive optimum values for the number of 1st stage clusters and the optimum values of observation units in a 2nd stage cluster under the condition of minimizing the variance given constant cost. We can see that Himmelfarb-Edgell's model is more efficient than Gjestvang-Singh's model under the condition of cluster sampling.

계급불균형자료의 분류: 훈련표본 구성방법에 따른 효과 (Classification of Class-Imbalanced Data: Effect of Over-sampling and Under-sampling of Training Data)

  • 김지현;정종빈
    • 응용통계연구
    • /
    • 제17권3호
    • /
    • pp.445-457
    • /
    • 2004
  • 두 계급의 분류문제에서 두 계급의 관측 개체수가 심하게 불균형을 이룬 자료를 분석할 때, 흔히 인위적으로 두 계급의 크기를 비슷하게 해준 다음 분석한다. 본 연구에서는 이런 훈련표본 구성방법의 타당성에 대해 알아보았다. 또한 훈련표본의 구성방법이 부스팅에 미치는 효과에 대해서도 알아보았다. 12개의 실제 자료에 대한 실험 결과 나무모형으로 부스팅 기법을 적용할 때는 훈련표본을 그대로 둔 채 분석하는 것이 좋다는 결론을 얻었다.

순서를 갖는 척도모수들의 사전정보 하에 k-모집단 와이블분포의 베이지안 모수추정 (Bayesian Estimation of k-Population Weibull Distribution Under Ordered Scale Parameters)

  • 손영숙;김성욱
    • 응용통계연구
    • /
    • 제16권2호
    • /
    • pp.273-282
    • /
    • 2003
  • 순서화된 척도모수들의 사전정보를 가지는 k-모집단 와이블분포의 모수추정을 위한 베이지안방법이 제시된다. 모수추정은 깁스샘플링에 의해서 이루어지며, 특히 깁스샘플러에서 형태모수의 조건부 사후분포는 로그-오목함수이므로 적응기각표집(Adaptive Rejection Sampling: ARS)방법에 의해 모수생성을 하였다. 논의된 모수추정법을 전기 절연유체 고장시간자료에 적용하여 척도모수의 순서화정보를 반영한 경우와 그렇지 않은 경우를 비교하였다.

가변 샘플링간격 EPC/SPC 결합시스템의 개발 (Development of Integrated Variable Sampling Interval EngineeringProcess Control & Statistical Process Control System)

  • 이성재;서순근
    • 대한산업공학회지
    • /
    • 제32권3호
    • /
    • pp.210-218
    • /
    • 2006
  • Traditional statistical process control (SPC) applied to discrete part industry in the form of control charts can look for and eliminate assignable causes by process monitoring. On the other hand, engineering process control (EPC) applied to the process industry in the form of feedback control can maintain the process output on the target by continual adjustment of input variable. This study presents controlling and monitoring rules adopted by variable sampling interval (VSI) to change sampling intervals in a predetermined fashion on the predicted process levels under integrated EPC and SPC systems. Twelve rules classified by EPC schemes(MMSE, constrained PI, bounded or deadband adjustment policy) and type of sampling interval combined with EWMA chart of SPC are proposed under IMA (1,1) disturbance model and zero-order (responsive) dynamic system. Properties of twelve control rules under three patterns of process change (sudden shift, drift and random shift) are evaluated and discussed through simulation and control rules for integrated VSI EPC and SPC systems are recommended.

Optimal Design of the Adaptive Searching Estimation in Spatial Sampling

  • Pyong Namkung;Byun, Jong-Seok
    • Communications for Statistical Applications and Methods
    • /
    • 제8권1호
    • /
    • pp.73-85
    • /
    • 2001
  • The spatial population existing in a plane ares, such as an animal or aerial population, have certain relationships among regions which are located within a fixed distance from one selected region. We consider with the adaptive searching estimation in spatial sampling for a spatial population. The adaptive searching estimation depends on values of sample points during the survey and on the nature of the surfaces under investigation. In this paper we study the estimation by the adaptive searching in a spatial sampling for the purpose of estimating the area possessing a particular characteristic in a spatial population. From the viewpoint of adaptive searching, we empirically compare systematic sampling with stratified sampling in spatial sampling through the simulation data.

  • PDF

불균형적인 이항 자료 분석을 위한 샘플링 알고리즘들: 성능비교 및 주의점 (On sampling algorithms for imbalanced binary data: performance comparison and some caveats)

  • 김한용;이우주
    • 응용통계연구
    • /
    • 제30권5호
    • /
    • pp.681-690
    • /
    • 2017
  • 파산감지, 스팸메일 감지, 불량품 감지 등 일상생활에서 불균형적인 이항 분류 문제를 다양하게 접할 수 있다. 반응변수의 클래스의 비율이 상당히 불균형한 경우 이항 분류 모형의 예측 성능이 좋지 않다는 점은 이미 잘 알려진 사실이다. 이러한 문제점을 해결하기 위해 그 동안 오버 샘플링, 언더 샘플링, SMOTE와 같은 여러 샘플링 기법이 개발되어 왔다. 본 연구에서는 분류 모형으로 많이 사용되는 기계학습모형으로 로지스틱 회귀모형, Lasso, 랜덤포레스트, 부스팅, 서포트 벡터 머신을 위의 샘플링 기법들과 결합하여 사용했을 때의 예측 성능을 살펴보았다. 실질적인 예측 성능의 개선 여부를 확인하기 위해 네 개의 실제 자료를 분석하였다. 이와 더불어, 샘플링 방법이 사용될 때 주의해야 할 점에 대해서 강조하였다.

LULUCF 부문 산림 온실가스 인벤토리 구축을 위한 Sampling과 Wall-to-Wall 방법론 비교 (Comparison of Sampling and Wall-to-Wall Methodologies for Reporting the GHG Inventory of the LULUCF Sector in Korea)

  • 박은빈;송철호;함보영;김지원;이종열;최솔이;이우균
    • 한국기후변화학회지
    • /
    • 제9권4호
    • /
    • pp.385-398
    • /
    • 2018
  • Although the importance of developing reliable and systematic GHG inventory has increased, the GIS/RS-based national scale LULUCF (Land Use, Land-Use Change and Forestry) sector analysis is insufficient in the context of the Paris Agreement. In this study, the change in $CO_2$ storage of forest land due to land use change is estimated using two GIS/RS methodologies, Sampling and Wall-to-Wall methods, from 2000 to 2010. Particularly, various imagery with sampling data and land cover maps are used for Sampling and Wall-to-Wall methods, respectively. This land use matrix of these methodologies and the national cadastral statistics are classified by six land-use categories (Forest land, Cropland, Grassland, Wetlands, Settlements, and Other land). The difference of area between the result of Sampling methods and the cadastral statistics decreases as the sample plot distance decreases. However, the difference is not significant under a 2 km sample plot. In the 2000s, the Wall-to-Wall method showed similar results to sampling under a 2 km distance except for the Settlement category. With the Wall-to-Wall method, $CO_2$ storage is higher than that of the Sampling method. Accordingly, the Wall-to-Wall method would be more advantageous than the Sampling method in the presence of sufficient spatial data for GHG inventory assessment. These results can contribute to establish an annual report system of national greenhouse gas inventory in the LULUCF sector.

비선형 시스템 출력 조절과 샘플링 영향 (Outpput Regulation of Nonlinear Systems and Time-Sampling Effects)

  • 정선태
    • 전자공학회논문지S
    • /
    • 제35S권11호
    • /
    • pp.96-105
    • /
    • 1998
  • 비선형 시스템 출력 조절기의 디지털 구현시에 고려해야 할 샘플링 영향을 조사하였다. 조사결과, 선형 시스템에서와 마찬가지로 '출력 조절됨'은 보존되나, 일반적으로 '출력 조절 가능성'은 보존되지 않음이 밝혀졌다. 또한, 출력 조절 가능성이 보존되는 것을 쉽게 판별할 수 있는 비선형 시스템의 어떤 종류를 파악하였다. 이러한 결과는 일반적으로 연속시간 비선형 시스템에서 설계된 출력 조절기를 이산화하여 얻은 디지털 출력 조절기는 샘플링 시간에 대해 1차 근사에 불구함을 알려준다. 따라서, 이 결과는 일반적으로 보다 개선된 근사 샘플치 비선형 출력 조절기를 구할 필요가 있다는 것을 암시한다.

  • PDF

EFFICIENT ESTIMATION OF POPULATION MEAN IN STRATIFIED SAMPLING USING REGRESSION TYPE ESTIMATOR

  • Grover Lovleen Kumar
    • Journal of the Korean Statistical Society
    • /
    • 제35권4호
    • /
    • pp.441-452
    • /
    • 2006
  • Here an efficient regression type estimator for a stratified population mean is proposed under the two-phase sampling scheme. While constructing the proposed estimator, it is assumed that the first auxiliary variable x is directly and highly correlated with the study variable y, and the second auxiliary variable z is directly and highly correlated with the first auxiliary variable x. However the variable z is not directly correlated with the variable y, but they are just correlated with each other only due to their direct and high correlation with the variable x. The proposed regression type estimator is found to be always more efficient than the existing estimators defined under the same situation.

가변 샘플링간격 EPC/SPC 결합시스템의 개발 (Development of Integrated Variable Sampling Interval Engineering Process Control & Statistical Process Control System)

  • 이성재;서순근
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회/대한산업공학회 2005년도 춘계공동학술대회 발표논문
    • /
    • pp.723-729
    • /
    • 2005
  • Traditional statistical process control(SPC) applied to discrete part industry in the form of control charts can look for and eliminate assignable causes by process monitoring. On the other hand, engineering process control(EPC) applied to the process industry in the form of feedback control can maintain the process output on the target by continual adjustment of input variable. This study presents controlling and monitoring rules adopted variable sampling interval(VSI) to change sampling intervals in a predetermined fashion on the predicted process levels for integrated EPC and SPC systems. Twelve rules classified by EPC schemes(MMSE, constrained PI, bounded or deadband adjustment policy) and type of sampling interval combined with EWMA chart of SPC are proposed under IMA(1,1) disturbance model and zero-order (responsive) dynamic system. The properties of twelve control rules under three patterns of process change(sudden shift, drift and random shift) are evaluated and discussed through simulation and control rules for integrated VSI EPC and SPC systems are recommended.

  • PDF