• 제목/요약/키워드: Sampling effect

검색결과 1,804건 처리시간 0.03초

계급불균형자료의 분류: 훈련표본 구성방법에 따른 효과 (Classification of Class-Imbalanced Data: Effect of Over-sampling and Under-sampling of Training Data)

  • 김지현;정종빈
    • 응용통계연구
    • /
    • 제17권3호
    • /
    • pp.445-457
    • /
    • 2004
  • 두 계급의 분류문제에서 두 계급의 관측 개체수가 심하게 불균형을 이룬 자료를 분석할 때, 흔히 인위적으로 두 계급의 크기를 비슷하게 해준 다음 분석한다. 본 연구에서는 이런 훈련표본 구성방법의 타당성에 대해 알아보았다. 또한 훈련표본의 구성방법이 부스팅에 미치는 효과에 대해서도 알아보았다. 12개의 실제 자료에 대한 실험 결과 나무모형으로 부스팅 기법을 적용할 때는 훈련표본을 그대로 둔 채 분석하는 것이 좋다는 결론을 얻었다.

샘플링율이 맥박변이도 시간 및 주파수 영역 분석에 미치는 영향 (An Effect of Sampling Rate to the Time and Frequency Domain Analysis of Pulse Rate Variability)

  • 양윤라;신항식
    • 전기학회논문지
    • /
    • 제65권7호
    • /
    • pp.1247-1251
    • /
    • 2016
  • This study aims to investigate the effect of sampling frequency to the time domain and frequency domain analysis of pulse rate variability (PRV). Typical time domain variables - AVNN, SDNN, SDSD, RMSSD, NN50 count and pNN50 - and frequency domain variables - VLF, LF, HF, LF/HF, Total Power, nLF and nHF - were derived from 7 down-sampled (250 Hz, 100 Hz, 50 Hz, 25 Hz, 20 Hz, 15 Hz, 10 Hz) PRVs and compared with the result of heart rate variability of 10 kHz-sampled electrocardiogram. Result showed that every variable of time domain analysis of PRV was significant at 25 Hz or higher sampling frequency. Also, in frequency domain analysis, every variable of PRV was significant at 15 Hz or higher sampling frequency.

멀티셋의 크기 추정 기법에서 샘플링의 효과 (Effect of Sampling for Multi-set Cardinality Estimation)

  • ;양대헌;이경희
    • 정보처리학회논문지:컴퓨터 및 통신 시스템
    • /
    • 제4권1호
    • /
    • pp.15-22
    • /
    • 2015
  • 멀티셋에서 중복을 제외한 서로 다른 원소의 수를 추정하는 것은 네트워크 트래픽 측정 분야에서 매우 잘 알려진 문제이며, 많은 알고리즘들이 제안되었다. 최근에는 선형 카운팅 기법(Linear Counting)에 기반해서 매우 작은 메모리만을 이용해서 멀티셋의 크기를 추정하는 알고리즘이 개발되었다. 너무 많은 데이터를 처리하기 어려운 경우 전체 데이터를 처리하지 않고, 패킷의 일부를 샘플링해서 사용하는데, 이 샘플링은 일반적으로 정확도에 부정적인 영향을 주는 것으로 알려져있다. 하지만, 이 논문에서는 멀티셋의 크기를 추정하는데 있어서 CSE를 이용하는 경우 샘플링이 정확도와 측정 범위의 측면에서 오히려 전수조사를 하는 MCSE보다 더 좋은 결과를 낼 수 있음을 보였다. 이를 입증하기 위해 수학적 분석, 실제 데이터를 이용한 실험을 수행하고, CSE, MCSE 그리고 CSES를 비교하였다.

A GENERALIZATION OF THE INTRACLASS CORRELATION IN CLUSTER SAMPLING

  • KIM KYU-SEONG
    • Journal of the Korean Statistical Society
    • /
    • 제34권3호
    • /
    • pp.185-195
    • /
    • 2005
  • This article is concerned with the intraclass correlation in survey sampling. From a design-based viewpoint the intraclass correlation is generalized to a finite population with unequal sized clusters. Under simple random cluster sampling the intraclass correlation is given in an explicit form, which is a generalization of the usual one. The range of it is found and the design effect is expressed by means of it. An example is given to compare the intraclass correlation with the homogeneity measure numerically, which shows that two measures are not the same except some limited cases.

Sampling Jitter Effect on a Reconfigurable Digital IF Transceiver to WiMAX and HSDPA

  • Yu, Bong-Guk;Lee, Jae-Kwon;Kim, Jin-Up;Lim, Kyu-Tae
    • ETRI Journal
    • /
    • 제33권3호
    • /
    • pp.326-334
    • /
    • 2011
  • This paper outlines the time jitter effect of a sampling clock on a software-defined radio technology-based digital intermediate frequency (IF) transceiver for a mobile communication base station. The implemented digital IF transceiver is reconfigurable to high-speed data packet access (HSDPA) and three bandwidth profiles: 1.75 MHz, 3.5 MHz, and 7 MHz, each incorporating the IEEE 802.16d worldwide interoperability for microwave access (WiMAX) standard. This paper examines the relationship between the signal-to-noise ratio (SNR) characteristics of a digital IF transceiver with an under-sampling scheme and the sampling jitter effect on a multichannel orthogonal frequency-division multiplexing (OFDM) signal. The simulation and experimental results show that the SNR of the OFDM system with narrower band profiles is more susceptible to sampling clock jitter than systems with relatively wider band profiles. Further, for systems with a comparable bandwidth, HSDPA outperforms WiMAX, for example, a 5 dB error vector magnitude improvement at 15 picoseconds time jitter for a bandwidth of WiMAX 3.5 MHz profile.

공기중 염화비닐단량체의 포집시 공기 포집량이 파과에 미치는 영향 (Effect of sampling volume on the breakthrough of charcoal tube during vinyl chloride monomer sampling)

  • 윤존중;임남구;김치년;노재훈
    • 한국산업보건학회지
    • /
    • 제11권3호
    • /
    • pp.241-248
    • /
    • 2001
  • The main factors of breakthrough are known to sampling time, flow rate, concentration of the sample, temperature, humidity, and the physical characteristics of the solid sorbent tube. However, no study has been reported the effect of temperature and sampling volume on the breakthrough of acharcoal tube during vinyl chloride monomer (VCM) sampling. The objective of this study is to suggest the optimal sampling condition during VCM sampling based on National Institute for Occupational Safety and Health (NIOSH) method. To evaluate adequate sampling volume for VCM without breakthrough, volume of 1, 2, 3, 4, and 5 L each from VCM of 1, 5, 10, 15, and 20ppm at flow rate of 0.05 L/min were sampled in $22^{\circ}C$ and $40^{\circ}C$. At $22^{\circ}C$, in the case of 1, 5, 10, and 15ppm, VCM was adsorbed completely in first section of charcoal tube regardless of sampling volume. But in 20ppm, detection rates are 99.56% in first section and 0.44% in second section. At $40^{\circ}C$ of 1ppm, VCM was adsorbed completely in first section. In 10, 15, and 20ppm, detection rates of second, third, and forth sections were decreased significantly by reduction of sampling volume. In determination of breakthrough based on NIOSH method, no breakthrough was occurred in 20ppm at $22^{\circ}C$. At $40^{\circ}C$, breakthrough was occurred in 10, 15, and 20ppm when sampling volume was 5L. Although no breakthrough was occurred when sampling volume was 3L. Finally, in environment of temperature around $22^{\circ}C$, breakthrough may not occurred up to 20ppm during sampling for VCM. During sampling for VCM in environment of temperature around $40^{\circ}C$, no breakthrough occurred in 1-5ppm and 10-20ppm when sampling volume is 5L and 3L respectively. This result suggests that the sampling volume should be considered when VCM sampling under hot conditions (> $22^{\circ}C$) by the NIOSH method No. 1007.

  • PDF

이단계표본추출을 이용한 소결핵병 유병률 추정 (Two-stage Sampling for Estimation of Prevalence of Bovine Tuberculosis)

  • 박선일
    • 한국임상수의학회지
    • /
    • 제28권4호
    • /
    • pp.422-426
    • /
    • 2011
  • For a national survey in which wide geographic region or an entire country is targeted, multi-stage sampling approach is widely used to overcome the problem of simple random sampling, to consider both herd- and animallevel factors associated with disease occurrence, and to adjust clustering effect of disease in the population in the calculation of sample size. The aim of this study was to establish sample size for estimating bovine tuberculosis (TB) in Korea using stratified two-stage sampling design. The sample size was determined by taking into account the possible clustering of TB-infected animals on individual herds to increase the reliability of survey results. In this study, the country was stratified into nine provinces (administrative unit) and herd, the primary sampling unit, was considered as a cluster. For all analyses, design effect of 2, between-cluster prevalence of 50% to yield maximum sample size, and mean herd size of 65 were assumed due to lack of information available. Using a two-stage sampling scheme, the number of cattle sampled per herd was 65 cattle, regardless of confidence level, prevalence, and mean herd size examined. Number of clusters to be sampled at a 95% level of confidence was estimated to be 296, 74, 33, 19, 12, and 9 for desired precision of 0.01, 0.02, 0.03, 0.04, 0.05, and 0.06, respectively. Therefore, the total sample size with a 95% confidence level was 172,872, 43,218, 19,224, 10,818, 6,930, and 4,806 for desired precision ranging from 0.01 to 0.06. The sample size was increased with desired precision and design effect. In a situation where the number of cattle sampled per herd is fixed ranging from 5 to 40 with a 5-head interval, total sample size with a 95% confidence level was estimated to be 6,480, 10,080, 13,770, 17,280, 20.925, 24,570, 28,350, and 31,680, respectively. The percent increase in total sample size resulting from the use of intra-cluster correlation coefficient of 0.3 was 22.2, 32.1, 36.3, 39.6, 41.9, 42.9, 42,2, and 44.3%, respectively in comparison to the use of coefficient of 0.2.

2차 샘플링을 이용한 초음파 도플러 시스템에서 샘플링 주파수 펄스 대역폭이 평균 주파수 측정에 미치는 영향 (The Effect of Sampling Frequency and Pulse Bandwidth on Estimating Mean Frequencies in an Ultrasonic Doppler System using the Second-Order Sampling)

  • 안영복;박송배
    • 한국음향학회지
    • /
    • 제9권3호
    • /
    • pp.48-55
    • /
    • 1990
  • 2차 샘플링이 도플러 신호의 평균 주파수 측정에 미치는 영향에 대해서 분석을 하였다. 아나로그/디지탈 변환기(analogue-to-digital converter)의 샘플링 주파수를 낮추기 위해 수신 신호를 $4f_0$ 대신에 $4f_0$/5나 $4f_0$/9의 낮은 주파수로 샘플링에서 도플러 신호를 얻을 수 있다. ($f_0$는 송신 신호의 중심 주파수이다.) 일반적인 도플러 시스템에서처럼 수신 신호가 협 대역이면 낮은 샘플링 주파수로 인해 발생되는 평균 주파수의 측정에 대한 오차는 무시할 정도로 작다는 것을 컴퓨터 시뮬레이션과 실험을 통해 증명하였다.

  • PDF

샘플링 시간에 대해 개선된 Singular Perturbation 기반 STT missile 디지털 autopilot 설계 (Design of an improved STT missile digital autopilot with respect to sampling time)

  • 정선태
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1997년도 한국자동제어학술회의논문집; 한국전력공사 서울연수원; 17-18 Oct. 1997
    • /
    • pp.468-471
    • /
    • 1997
  • In this paper, we investigate the time-sampling effects on the digital implementation of singular perturbation based STT autopilot with excellent performance and propose a compensation method for the time-sampling effects. In digitization of analog STT autopilot, it is found that the stability margin of the fast dynamics is mostly affected to lead to rapid decrease. Under the this analysis, a composite digital controller with additional compensator for fast dynamics is proposed to improve the time-sampling effect and a simulation verifies the result.

  • PDF

확률적으로 종속적인 비평형 다단계 샘플링검사법의 설계 및 평가 (Design and Estimation of Multiple Acceptance Sampling Plans for Stochastically Dependent Nonstationary Processes)

  • 김원경
    • 대한산업공학회지
    • /
    • 제25권1호
    • /
    • pp.8-20
    • /
    • 1999
  • In this paper, a design and estimation procedure for the stochastically dependent nonstationary multiple acceptance sampling plans is developed. At first, the rough-cut acceptance and rejection numbers are given as an initial solution from the corresponding sequential sampling plan. A Monte-Carlo algorithm is used to find the acceptance and rejection probabilities of a lot. The conditional probability formula for a sample path is found. The acceptance and rejection probabilities are found when a decision boundary is given. Several decision criteria and the design procedure to select optimal plans are suggested. The formula for measuring performance of these sampling plans is developed. Type I and II error probabilities are also estimated. As a special case, by setting the stage size as 1 in a dependent sampling plan, a sequential sampling plan satisfying type I and II error probabilities is more accurate and a smaller average sample number can be found. In a numerical example, a Polya dependent process is examined. The sampling performances are shown to compare the selection scheme and the effect of the change of the dependency factor.

  • PDF