• Title/Summary/Keyword: Biased Data

Search Result 350, Processing Time 0.025 seconds

IMAGE DATA CHAIN ANALYSIS FOR SATELLITE CAMERA ELECTRONIC SYSTEM

  • Park, Jong-Euk;Kong, Jong-Pil;Heo, Haeng-Pal;Kim, Young-Sun;Chang, Young-Jun
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.791-793
    • /
    • 2006
  • In the satellite camera, the incoming light source is converted to electronic analog signals by the electronic component for example CCD (Charge Coupled Device) detectors. The analog signals are amplified, biased and converted into digital signals (pixel data stream) in the video processor (A/Ds). The outputs of the A/Ds are digitally multiplexed and driven out using differential line drivers (two pairs of wires) for cross strap requirement. The MSC (Multi-Spectral Camera) in the KOMPSAT-2 which is a LEO spacecraft will be used to generate observation imagery data in two main channels. The MSC is to obtain data for high-resolution images by converting incoming light from the earth into digital stream of pixel data. The video data outputs are then MUXd, converted to 8 bit bytes, serialized and transmitted to the NUC (Non-Uniformity Correction) module by the Hotlink data transmitter. In this paper, the video data streams, the video data format, and the image data processing routine for satellite camera are described in terms of satellite camera control hardware. The advanced satellite with very high resolution requires faster and more complex image data chain than this algorithm. So, the effective change of the used image data chain and the fast video data transmission method are discussed in this paper

  • PDF

Analysis of Recurrent Gap Time Data with a Binary Time-Varying Covariate

  • Kim, Yang-Jin
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.5
    • /
    • pp.387-393
    • /
    • 2014
  • Recurrent gap times are analyzed with diverse methods under several assumptions such as a marginal model or a frailty model. Several resampling techniques have been recently suggested to estimate the covariate effect; however, these approaches can be applied with a time-fixed covariate. According to simulation results, these methods result in biased estimates for a time-varying covariate which is often observed in a longitudinal study. In this paper, we extend a resampling method by incorporating new weights and sampling scheme. Simulation studies are performed to compare the suggested method with previous resampling methods. The proposed method is applied to estimate the effect of an educational program on traffic conviction data where a program participation occurs in the middle of the study.

M-quantile kernel regression for small area estimation (소지역 추정을 위한 M-분위수 커널회귀)

  • Shim, Joo-Yong;Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.749-756
    • /
    • 2012
  • An approach widely used for small area estimation is based on linear mixed models. However, when the functional form of the relationship between the response and the input variables is not linear, it may lead to biased estimators of the small area parameters. In this paper we propose M-quantile kernel regression for small area mean estimation allowing nonlinearities in the relationship between the response and the input variables. Numerical studies are presented that show the sample properties of the proposed estimation method.

Determination of Research Octane Number using NIR Spectral Data and Ridge Regression

  • Jeong, Ho Il;Lee, Hye Seon;Jeon, Ji Hyeok
    • Bulletin of the Korean Chemical Society
    • /
    • v.22 no.1
    • /
    • pp.37-42
    • /
    • 2001
  • Ridge regression is compared with multiple linear regression (MLR) for determination of Research Octane Number (RON) when the baseline and signal-to-noise ratio are varied. MLR analysis of near-infrared (NIR) spectroscopic data usually encounters a collinearity problem, which adversely affects long-term prediction performance. The collinearity problem can be eliminated or greatly improved by using ridge regression, which is a biased estimation method. To evaluate the robustness of each calibration, the calibration models developed by both calibration methods were used to predict RONs of gasoline spectra in which the baseline and signal-to-noise ratio were varied. The prediction results of a ridge calibration model showed more stable prediction performance as compared to that of MLR, especially when the spectral baselines were varied. . In conclusion, ridge regression is shown to be a viable method for calibration of RON with the NIR data when only a few wavelengths are available such as hand-carry device using a few diodes.

Revisiting a Gravity Model of Immigration: A Panel Data Analysis of Economic Determinants

  • Kim, Kyunghun
    • East Asian Economic Review
    • /
    • v.26 no.2
    • /
    • pp.143-169
    • /
    • 2022
  • This study investigates the effect of economic factors on immigration using the gravity model of immigration. Cross-sectional regression and panel data analyses are conducted from 2000 to 2019 using the OECD International Migration Database, which consists of 36 destination countries and 201 countries of origin. The Poisson pseudo-maximum-likelihood method, which can effectively correct potential biased estimates caused by zeros in the immigration data, is used for estimation. The results indicate that the economic factors strengthened after the global financial crisis. Additionally, this effect varies depending on the type of immigration (the income level of origin country). The gravity model applied to immigration performs reasonably well, but it is necessary to consider the country-specific and time-varying characteristics.

The Effect of Bias in Data Set for Conceptual Clustering Algorithms

  • Lee, Gye Sung
    • International journal of advanced smart convergence
    • /
    • v.8 no.3
    • /
    • pp.46-53
    • /
    • 2019
  • When a partitioned structure is derived from a data set using a clustering algorithm, it is not unusual to have a different set of outcomes when it runs with a different order of data. This problem is known as the order bias problem. Many algorithms in machine learning fields try to achieve optimized result from available training and test data. Optimization is determined by an evaluation function which has also a tendency toward a certain goal. It is inevitable to have a tendency in the evaluation function both for efficiency and for consistency in the result. But its preference for a specific goal in the evaluation function may sometimes lead to unfavorable consequences in the final result of the clustering. To overcome this bias problems, the first clustering process proceeds to construct an initial partition. The initial partition is expected to imply the possible range in the number of final clusters. We apply the data centric sorting to the data objects in the clusters of the partition to rearrange them in a new order. The same clustering procedure is reapplied to the newly arranged data set to build a new partition. We have developed an algorithm that reduces bias effect resulting from how data is fed into the algorithm. Experiment results have been presented to show that the algorithm helps minimize the order bias effects. We have also shown that the current evaluation measure used for the clustering algorithm is biased toward favoring a smaller number of clusters and a larger size of clusters as a result.

Error cause analysis of Pearson test statistics for k-population homogeneity test (k-모집단 동질성검정에서 피어슨검정의 오차성분 분석에 관한 연구)

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.815-824
    • /
    • 2013
  • Traditional Pearson chi-squared test is not appropriate for the data collected by the complex sample design. When one uses the traditional Pearson chi-squared test to the complex sample categorical data, it may give wrong test results, and the error may occur not only due to the biased variance estimators but also due to the biased point estimators of cell proportions. In this study, the design based consistent Wald test statistics was derived for k-population homogeneity test, and the traditional Pearson chi-squared test statistics was partitioned into three parts according to the causes of error; the error due to the bias of variance estimator, the error due to the bias of cell proportion estimator, and the unseparated error due to the both bias of variance estimator and bias of cell proportion estimator. An analysis was conducted for empirical results of the relative size of each error component to the Pearson chi-squared test statistics. The second year data from the fourth Korean national health and nutrition examination survey (KNHANES, IV-2) was used for the analysis. The empirical results show that the relative size of error from the bias of variance estimator was relatively larger than the size of error from the bias of cell proportion estimator, but its degrees were different variable by variable.

An Analysis on Efficiency for the Environmental Friendly Agricultural Product of Strawberry in GyeongBuk Province (경북지역 친환경딸기 농가의 인증유형에 따른 효율성 분석)

  • Lee, Sang-Ho;Song, Kyung-Hwan
    • Korean Journal of Organic Agriculture
    • /
    • v.21 no.4
    • /
    • pp.487-500
    • /
    • 2013
  • The purpose of this study is to estimate efficiency of environmental-friendly agricultural product by using Data Envelopment Analysis. A proposed method employs a bootstrapping approach to generating efficiency estimates through Monte Carlo simulation resampling process. The technical efficiency, pure technical efficiency, and scale efficiency measure of strawberry by pesticide-free certification is 0.967, 0.995, 0.968 respectively. However those of bias-corrected estimates are 0.918, 0.983, 0.934. We know that the DEA estimator is an upward biased estimator. In technical efficiency, average lower and upper confidence bounds of 0.807 and 0.960. According to these results, the DEA bootstrapping model used here provides bias-corrected and confidence intervals for the point estimates, it is more preferable.

A Study on the Future Direction of the Digital Signage Industry in Korea: A Big Data Network Analysis from 2008 to 2019

  • Yoo, Seung-Chul;Piscarac, Diana
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.1
    • /
    • pp.120-127
    • /
    • 2020
  • The use of digital signage in the public and commercial communication areas has been increasing in recent years. By integrating cutting-edge information technologies such as 5G, artificial intelligence, and the Internet of Things, digital signage continues to break apart from traditional outdoor advertising media. This study identified the problems facing the domestic digital signage industry by exploring and analyzing major issues related to digital signage and derived future development measures. Specifically, online documents were collected based on the digital signage-related keywords created over the past 12 years to conduct big data network analysis, and key topics were derived through visualization of the results. This study has great policy implications in that it excluded biased interpretations based on the viewpoints of companies or the government and, more objectively, suggested the direction of the digital signage industry's development in the domestic media market.

The Natural Killer Cell Response to HCV Infection

  • Ahlenstiel, Golo
    • IMMUNE NETWORK
    • /
    • v.13 no.5
    • /
    • pp.168-176
    • /
    • 2013
  • In the last few years major progress has been made in better understanding the role of natural killer (NK) cells in hepatitis C virus (HCV) infection. This includes multiple pathways by which HCV impairs or limits NK cells activation. Based on current genetic and functional data, a picture is emerging where only a rapid and strong NK cell response early on during infection which results in strong T cell responses and possible subsequent clearance, whereas chronic HCV infection is associated with dysfunctional or biased NK cells phenotypes. The hallmark of this NK cell dysfunction is persistent activation promoting ongoing hepatitis and hepatocyte damage, while being unable to clear HCV due to impaired IFN-${\gamma}$ responses. Furthermore, some data suggests certain chronically activated subsets that are $NKp46^{high}$ may be particularly active against hepatic stellate cells, a key player in hepatic fibrogenesis. Finally, the role of NK cells during HCV therapy, HCV recurrence after liver transplant and hepatocellular carcinoma are discussed.