• Title/Summary/Keyword: data sampling

Search Result 5,054, Processing Time 0.034 seconds

Sampling Techniques for Wireless Data Broadcast in Communication (통신에서의 무선 데이터 방송을 위한 샘플링 기법)

  • Lee, Sun Yui;Park, Gooman;Kim, Jin Young
    • Journal of Satellite, Information and Communications
    • /
    • v.10 no.3
    • /
    • pp.57-61
    • /
    • 2015
  • This paper describes the basic principles of 3D broadcast system and proposes new 3D broadcast technology that reduces the amount of data by applying CS(Compressed Sensing). Differences between Sampling theory and the CS technology concept was described. CS algorithm SS-CoSaMP(Single-Space Compressive Sampling Matched Pursuit) and AMP(Approximate Message Passing) was described. Image data compressed and restored by these algorithm was compared. Calculation time of the algorithm having a low complexity is determined.

An Improved Linear Sampled-data Output Regulator (개선된 선형 샘플치 출력 조절기)

  • Chung, Sun-Tae
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.10
    • /
    • pp.85-93
    • /
    • 1998
  • In general, the solvability of linear robust output regulation problem are not preserved under time-sampling. Thus, it is found that the digital regulator implemented by time-sampling of analog output regulator designed based on the continuous-time linear system model is nothing but a 1st order approximation with respect to time-sampling. However, one can design an improved sampled-data regulator with respect to sampling time by utilizing teh intrinsic structure of the system. In this paper, we study the system structures for which it is possible to design an improved sampled-data regulator with respect to sampling time.

  • PDF

A Study on the Sampling of Ocean Meteorological Data to Analyze Signature of Naval Ships (함정 신호해석 연구에 필요한 해양기상환경 자료의 표본추출에 관한 연구)

  • Cho, Yong-Jin
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.23 no.2
    • /
    • pp.19-28
    • /
    • 2018
  • In this paper, we studied on the sampling of ocean meteorological data to analyze signature of naval ships. The newest ocean meteorological data, that was quality controled by the Korea Meteorological Administration(KMA), was collected. Outliers were removed from the data by setting the usable range of data. After that, the data size was reduced through the random sampling method, taking geopolitical significance and effective area of buoy, for probabilistic analysis. Moreover, the sample sizes were set at 100, 200, and 400 by considering the population size and a 95% confidence level. The final sample was obtained using the two-dimensional stratified sampling method based on highly correlated water temperature and air temperature. The sum of the squared errors and the confidence interval was calculated to compare the result of sampling. As a result, this study proposed reasonable sample size for infra­red signature analysis of naval ships.

An Evaluation of Sampling Design for Estimating an Epidemiologic Volume of Diabetes and for Assessing Present Status of Its Control in Korea (우리나라 당뇨병의 역학적 규모와 당뇨병 관리현황 파악을 위한 표본설계의 평가)

  • Lee, Ji-Sung;Kim, Jai-Yong;Baik, Sei-Hyun;Park, Ie-Byung;Lee, June-Young
    • Journal of Preventive Medicine and Public Health
    • /
    • v.42 no.2
    • /
    • pp.135-142
    • /
    • 2009
  • Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a 'patient-hospital profile' that had 3,676,164 cases, and then to a 'patient profile' that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04%(95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into "metropolitan/city/county" and the types of hospital into "tertiary/secondary/primary/clinic" with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.

Influence Analysis of Sampling Points on Accuracy of Storage Reliability Estimation for One-shot Systems (원샷 시스템의 저장 신뢰성 추정 정확성에 대한 샘플링 시점의 영향 분석)

  • Chung, Yong H.;Oh, Bong S.;Lee, Hong C.;Park, Hee N.;Jang, Joong S.;Park, Sang C.
    • Journal of Applied Reliability
    • /
    • v.16 no.1
    • /
    • pp.32-40
    • /
    • 2016
  • Purpose: The purpose of this study is to analyze the effect of sampling points on accuracy of storage reliability estimation for one-shot systems by assuming a weibull distribution as a storage reliability distribution. Also propose method for determining of sampling points for increase the accuracy of reliability estimation. Methods: Weibull distribution was divided into three sections for confirming the possible to estimate the parameters of the weibull distribution only some section's sample. Generate quantal response data for failure data. And performed parameter estimation with quantal response data. Results: If reduce sample point interval of 1 section, increase the accuracy of reliability estimation although sampling only section 1. Even reduce total number of sampling point, reducing sampling time interval of the 1 zone improve the accuracy of reliability estimation. Conclusion: Method to increase the accuracy of reliability estimation is increasing number of sampling and the sampling points. But apply this method to One-shot system is difficult because test cost of one-shot system is expensive. So propose method of accuracy of storage reliability estimation of one-shot system by adjustment of the sampling point. And by dividing the section it could reduce the total sampling point.

Real-time Acquisition of Three Dimensional NMR Spectra by Non-uniform Sampling and Maximum Entropy Processing

  • Jee, Jun-Goo
    • Bulletin of the Korean Chemical Society
    • /
    • v.29 no.10
    • /
    • pp.2017-2022
    • /
    • 2008
  • Of the experiments to shorten NMR measuring time by sparse sampling, non-uniform sampling (NUS) is advantageous. NUS miminizes systematic errors which arise due to the lack of samplings by randomization. In this study, I report the real-time acquisition of 3D NMR data using NUS and maximum-entropy (MaxEnt) data processing. The real-time acquisition combined with NUS can reduce NMR measuring time much more. Compared with multidimensional decomposition (MDD) method, which was originally suggested by Jaravine and Orekhov (JACS 2006, 13421-13426), MaxEnt is faster at least several times and more suitable for the realtime acquisition. The designed sampling schedule of current study makes all the spectra during acquisition have the comparable resulting resolutions by MaxEnt. Therefore, one can judge the quality of spectra easily by examining the intensities of peaks. I report two cases of 3D experiments as examples with the simulated subdataset from experimental data. In both cases, the spectra having good qualitie for data analysis could be obtained only with 3% of original data. Its corresponding NMR measuring time was 8 minutes for 3D HNCO of ubiquitin.

Comparison of Two Methods for Measuring Daily Path Lengths in Arboreal Primates

  • Lappan, Susan
    • Journal of Ecology and Environment
    • /
    • v.30 no.2
    • /
    • pp.201-207
    • /
    • 2007
  • Researchers have used a variety of methods to measure patterns of animal movement, including the use of spatial data (mapping the position of a moving animal at specified intervals) and direct estimation of travel path length by pacing under a moving animal or group. I collected movement data from five groups of siamangs (Symphalangus syndactylus) using two different methods concurrently to estimate the effects of the method of data collection on estimates of daily path length (DPL). Estimates of DPL produced from spatial data collected at 15-minute intervals were 12% lower than estimates of DPL produced by pacing under the traveling animal. The actual magnitude of the difference was correlated with the travel distance, but there was no correlation between the proportional difference and the travel distance. While the collection of spatial data is generally preferable, as spatial data permit additional analyses of patterns of movements in two or three dimensions, the relatively small difference between the DPL's produced using different methods suggests that pacing is an acceptable substitute where the collection of spatial data is impractical. I also subsampled the spatial data at increasing time intervals to assess the effect of sampling interval on the calculation of daily path lengths. Longer sampling intervals produced significantly shorter estimates of travel paths than shorter sampling intervals. These results suggest that spatial data should be collected at short time intervals wherever possible, and that sampling intervals should not exceed 30 minutes. Researchers should be cautious when comparing data generated using different methods.

On the calibration problem with censored data (중도 절단 자료에서의 역추정 문제)

  • 박래현;이석훈;이낙영;박영옥;이상호
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.1
    • /
    • pp.1-17
    • /
    • 1994
  • This article basically considers the calibration problem with censored data from the Bayesian point of view. The Gibbs sampling method is discussed to solve the difficulty encountered in computing the posterior distribution. Also presented is an approach for impementing the Gibbs sampling in actual data situation with the estimation procedures.

  • PDF

Randomized Response Model with Discrete Quantitative Attribute by Three-Stage Cluster Sampling

  • Lee, Gi-Sung;Hong, Ki-Hak
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.4
    • /
    • pp.1067-1082
    • /
    • 2003
  • In this paper, we propose a randomized response model with discrete quantitative attribute by three-stage cluster sampling for obtaining discrete quantitative data by using the Liu & Chow model(1976), when the population was made up of sensitive discrete quantitative clusters. We obtain the minimum variance by calculating the optimum number of fsu, ssu, tsu under the some given constant cost. And we obtain the minimum cost under the some given accuracy.

  • PDF

Ensemble Learning for Solving Data Imbalance in Bankruptcy Prediction (기업부실 예측 데이터의 불균형 문제 해결을 위한 앙상블 학습)

  • Kim, Myoung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.15 no.3
    • /
    • pp.1-15
    • /
    • 2009
  • In a classification problem, data imbalance occurs when the number of instances in one class greatly outnumbers the number of instances in the other class. Such data sets often cause a default classifier to be built due to skewed boundary and thus the reduction in the classification accuracy of such a classifier. This paper proposes a Geometric Mean-based Boosting (GM-Boost) to resolve the problem of data imbalance. Since GM-Boost introduces the notion of geometric mean, it can perform learning process considering both majority and minority sides, and reinforce the learning on misclassified data. An empirical study with bankruptcy prediction on Korea companies shows that GM-Boost has the higher classification accuracy than previous methods including Under-sampling, Over-Sampling, and AdaBoost, used in imbalanced data and robust learning performance regardless of the degree of data imbalance.

  • PDF