• Title/Summary/Keyword: Model Assumption

Search Result 1,728, Processing Time 0.028 seconds

Statistical micro matching using a multinomial logistic regression model for categorical data

  • Kim, Kangmin;Park, Mingue
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.507-517
    • /
    • 2019
  • Statistical matching is a method of combining multiple sources of data that are extracted or surveyed from the same population. It can be used in situation when variables of interest are not jointly observed. It is a low-cost way to expect high-effects in terms of being able to create synthetic data using existing sources. In this paper, we propose the several statistical micro matching methods using a multinomial logistic regression model when all variables of interest are categorical or categorized ones, which is common in sample survey. Under conditional independence assumption (CIA), a mixed statistical matching method, which is useful when auxiliary information is not available, is proposed. We also propose a statistical matching method with auxiliary information that reduces the bias of the conventional matching methods suggested under CIA. Through a simulation study, proposed micro matching methods and conventional ones are compared. Simulation study shows that suggested matching methods outperform the existing ones especially when CIA does not hold.

A comparison of Multilayer Perceptron with Logistic Regression for the Risk Factor Analysis of Type 2 Diabetes Mellitus (제2형 당뇨병의 위험인자 분석을 위한 다층 퍼셉트론과 로지스틱 회귀 모델의 비교)

  • 서혜숙;최진욱;이홍규
    • Journal of Biomedical Engineering Research
    • /
    • v.22 no.4
    • /
    • pp.369-375
    • /
    • 2001
  • The statistical regression model is one of the most frequently used clinical analysis methods. It has basic assumption of linearity, additivity and normal distribution of data. However, most of biological data in medical field are nonlinear and unevenly distributed. To overcome the discrepancy between the basic assumption of statistical model and actual biological data, we propose a new analytical method based on artificial neural network. The newly developed multilayer perceptron(MLP) is trained with 120 data set (60 normal, 60 patient). On applying test data, it shows the discrimination power of 0.76. The diabetic risk factors were also identified from the MLP neural network model and the logistic regression model. The signigicant risk factors identified by MLP model were post prandial glucose level(PP2), sex(male), fasting blood sugar(FBS) level, age, SBP, AC and WHR. Those from the regression model are sex(male), PP2, age and FBS. The combined risk factors can be identified using the MLP model. Those are total cholesterol and body weight, which is consistent with the result of other clinical studies. From this experiment we have learned that MLP can be applied to the combined risk factor analysis of biological data which can not be provided by the conventional statistical method.

  • PDF

ESTIMATION OF SYSTEM RELIABLITY FOR REDUNDANT STRESS-STRENGTH MODEL

  • Choi, In-Kyeong
    • Journal of applied mathematics & informatics
    • /
    • v.5 no.2
    • /
    • pp.277-284
    • /
    • 1998
  • The reliability and an estimate for it are derived for series-parallel and parallel-deries stress-strength model under assumption that all components are subjected to a common stress. We also obtain the asymptotic normal distribution of the estimate.

A comparison study for accuracy of exit poll based on nonresponse model (무응답모형에 기반한 출구조사의 예측 정확성 비교 연구)

  • Kwak, Jeongae;Choi, Boseung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.53-64
    • /
    • 2014
  • One of the major problems to forecast election, especially based on survey, is nonresponse. We may have different forecasting results depend on method of imputation. Handling nonresponse is more important in a survey about sensitive subject, such as presidential election. In this research, we consider a model based method of nonresponse imputation. A model based imputation method should be constructed based on assumption of nonresponse mechanism and may produce different results according to the nonresponse mechanism. An assumption of the nonresponse mechanism is very important precondition to forecast the accurate results. However, there is no exact way to verify assumption of the nonresponse mechanism. In this paper, we compared the accuracy of prediction and assumption of nonresponse mechanism based on the result of presidential election exit poll. We consider maximum likelihood estimation method based on EM algorithm to handle assumption of the model of nonresponse. We also consider modified within precinct error which Bautista (2007) proposed to compare the predict result.

Statistical Design of VSS $\overline{A}$ Charts for Monitoring an AR(1) Process (AR(l) 공정을 탐지하는 VSS $\overline{A}$ 관리도의 통계적 설계)

  • 이재헌
    • Journal of Korean Society for Quality Management
    • /
    • v.31 no.3
    • /
    • pp.126-135
    • /
    • 2003
  • A basic assumption in standard applications of control charts is that the observations are statistically independent. However, this assumption is often violated from processes in many industries. The presence of autocorrelation has a serious impact on the performance of control charts, causing a dramatic increase in the frequency of false alarms. This paper considers a process in which the observations can be modeled as a first order autoregressive(AR(1)) process, and develops (equation omitted) charts with the variable sample size(VSS) scheme for monitoring the mean of this process.

Factors Affecting Acceptance and Use of E-Tax Services among Medium Taxpayers in Phnom Penh, Cambodia

  • ANN, Samnang;DAENGDEJ, Jirapun;VONGURAI, Rawin
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.8 no.7
    • /
    • pp.79-90
    • /
    • 2021
  • The purpose of this research is to identify factors affecting the acceptance and use of e-tax services among medium taxpayers in Phnom Penh, Cambodia. The researcher conducted the study based on a quantitative approach by using multi-stage sampling method, which selects a sample size by two or more stages. The first stage sampling was the stratified random sampling and the subsequent stage was purposive sampling. In this study, the stratified random sampling was first used, followed by purposive sampling. The data were collected from 450 medium taxpayers who experienced using e-tax services located in three tax branches in Phnom Penh. This study adapted the confirmatory factor analysis (CFA) and structural equation model (SEM) to analyze the model accuracy, reliability and influence of various variables. The primary result showed that behavioral intention has a significant effect on user behavior of e-tax services among medium taxpayers in Phnom Penh, Cambodia. Moreover, the results revealed that performance expectancy, effort expectancy, social influence, and anxiety have significant impact on behavioral intention. In addition, social influence has the strongest impact on behavioral intention, followed by anxiety, performance expectancy and effort expectancy. Conversely, facilitating conditions, trust in government, and trust in internet do not influence behavioral intention.

Reappraisal of Mean-Reversion of Stock Prices in the State-Space Model (상태공간모형에서 주가의 평균회귀현상에 대한 재평가)

  • Jeon, Deok-Bin;Choe, Won-Hyeok
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2006.11a
    • /
    • pp.173-179
    • /
    • 2006
  • In order to explain a U-shape pattern of stock returns, Fama and French(1988) suggested the state-space model consisting of I(1) permanent component and AR(1) stationary component. They concluded the autoregression coefficient induced from the state-space model follow the U-shape pattern and the U-shape pattern of stock returns was due to both negative autocorrelation in returns beyond a year and substantial mean-reversion in stock market prices. However, we found negative autocorrelation is induced under the assumption that permanent and stationary noise component are independent in the state-space model. In this paper, we derive the autoregression coefficient based on ARIMA process equivalent to the state-space model without the assumption of independency. Based on the estimated parameters, we investigate the pattern of the time-varying autoregression coefficient and conclude the autoregression coefficient from the state-space model of ARIMA(1,1,1) process does not follow a U-shape pattern, but has always positive sign. We applied this result on the data of 1 month retums for all NYSE stocks for the 1926-85 period from the Center for Research in Security Prices.

  • PDF

Development of Regression Models Resolving High-Dimensional Data and Multicollinearity Problem for Heavy Rain Damage Data (호우피해자료에서의 고차원 자료 및 다중공선성 문제를 해소한 회귀모형 개발)

  • Kim, Jeonghwan;Park, Jihyun;Choi, Changhyun;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.38 no.6
    • /
    • pp.801-808
    • /
    • 2018
  • The learning of the linear regression model is stable on the assumption that the sample size is sufficiently larger than the number of explanatory variables and there is no serious multicollinearity between explanatory variables. In this study, we investigated the difficulty of model learning when the assumption was violated by analyzing a real heavy rain damage data and we proposed to use a principal component regression model or a ridge regression model after integrating data to overcome the difficulty. We evaluated the predictive performance of the proposed models by using the test data independent from the training data, and confirmed that the proposed methods showed better predictive performances than the linear regression model.

Modern Capital Theory and Optimal Fisheries Management (현대자본이론과 최적어업관리)

  • 박장일
    • The Journal of Fisheries Business Administration
    • /
    • v.23 no.2
    • /
    • pp.53-67
    • /
    • 1992
  • It has been recognized, virtually from the time of its inception, that fisheries economics, like other aspects of resource economics, should ideally be cast in capital-theoretic terms. The fish population or biomass can be viewed as a capital stock in that, like conventional or man-made capital, it is capable of yielding a sustainable consumption flow through time. This study is to introduce the optimal control theory which was extended from the theory of calculus of variations into the study of former static theory of fisheries economics started by Gordon (1954). The optimal control theory eliminated the inadequacies of the classical techniques to a large extent. From this point of view, this study, on the base of Schaefer model, summerizes most of major results achieved so far, but does so in a manner such that the links with capital theory are made transparent. This study explores two sets of problems. The first concerns the optimal approach to the equilibrium stock, i.e. the optimal investment policy. The second set of problems arises from the relaxation of the highly restrictive assumption of autonomy (i.e. the assumption that the parameters are independent of time), then concludes the relaxation of linearity assumption together with the complexities caused by that.

  • PDF