• Title/Summary/Keyword: 다변량 모형

Search Result 267, Processing Time 0.034 seconds

Stochastic Generation Model Development for Optimum Reservoir Operation of Water Distribution System (저수지 최적운영모형을 위한 추계학적 모의 발생 모형의 유도)

  • Kim, Tae Geun;Yoon, Yong Nam;Kim, Joong Hoon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.14 no.4
    • /
    • pp.887-896
    • /
    • 1994
  • It is common practice in the case of optimum reservoir operation model that the reservoir inflow series are generated by stochastic model with keeping other variable such as water demands from the reservoir constant. However, when the input and output of the water distribution system have close relationship the output variables can be stochastically generated in relation with the input variables. In the present study the reservoir inflow series, the input of the system, is generated by periodic autoregressive model with constant parameter, and the agricultural water demand series, the output, is generated using periodic multivariate autoregressive model with constant parameter. The time period of the data series generated is taken as 10-day which is the common period used for agricultural water uses. The results of data generation by two different models showed that the periodic stochastic models well represent the characteristics of the historical time series, and that in the case of generating model for agricultural demand series it has closer relation with reservoir inflow than with the series itself.

  • PDF

Multivariate Analysis for Clinicians (임상의를 위한 다변량 분석의 실제)

  • Oh, Joo Han;Chung, Seok Won
    • Clinics in Shoulder and Elbow
    • /
    • v.16 no.1
    • /
    • pp.63-72
    • /
    • 2013
  • In medical research, multivariate analysis, especially multiple regression analysis, is used to analyze the influence of multiple variables on the result. Multiple regression analysis should include variables in the model and the problem of multi-collinearity as there are many variables as well as the basic assumption of regression analysis. The multiple regression model is expressed as the coefficient of determination, $R^2$ and the influence of independent variables on result as a regression coefficient, ${\beta}$. Multiple regression analysis can be divided into multiple linear regression analysis, multiple logistic regression analysis, and Cox regression analysis according to the type of dependent variables (continuous variable, categorical variable (binary logit), and state variable, respectively), and the influence of variables on the result is evaluated by regression coefficient${\beta}$, odds ratio, and hazard ratio, respectively. The knowledge of multivariate analysis enables clinicians to analyze the result accurately and to design the further research efficiently.

Moments of the Bivariate Zero-Inflated Poisson Distributions (이변량 영과잉-포아송 분포의 적률)

  • Kim, Kyung-Moo;Lee, Sung-Ho;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.1
    • /
    • pp.47-56
    • /
    • 1998
  • Zero-Inflated Poisson models are mixed models of the Poisson and Bernoulli models. Recently Zero-Inflated Poisson distributions have been used frequently rather than previous Poisson distributions because the developement of industrial technology make few defects in manufacturing process. It is important that univariate Zero-Inflated Poisson distributions are extended to bivariate distributions to generalize the multivariate distributions. In this paper we proposed three types of the bivariate Zero-Inflated Poisson distributions and obtained these moments. We compared the three types of distributions by using the moments.

  • PDF

A mixed model for repeated split-plot data (반복측정의 분할구 자료에 대한 혼합모형)

  • Choi, Jae-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.1
    • /
    • pp.1-9
    • /
    • 2010
  • This paper suggests a mixed-effects model for analyzing split-plot data when there is a repeated measures factor that affects on the response variable. Covariance structures are discussed among the observations because of the assumption of a repeated measures factor as one of explanatory variables. As a plausible covariance structure, compound symmetric covariance structure is assumed for analyzing data. The restricted maximum likelihood (REML)method is used for estimating fixed effects in the model.

A Verification of the validity for Technology/Credit Appraisal Model (기술신용평가모형의 타당성 검증)

  • Kim, Jae-Beom;Jo, Yong-Gon;Jo, Geun-Tae
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 2005.05a
    • /
    • pp.1068-1071
    • /
    • 2005
  • 최근 들어 기술을 담보로 하는 신용금융의 역할이 증대되면서 자금지원 대상기업의 기술평가 시스템 구축이 중요한 과제가 되고 있다. 국내에서는 기업 보유의 기술경영성과를 측정하여 한정된 자원의 효율적 배분을 위한 민간 투, 융자를 위한 기술신용평가모형'이 제시되었다 본 연구에서는 기술신용평가모델의 평가항목 타당성을 실증 분석한다. 모형의 항목 분류가 적절하게 되었는지를 검증하기 위하여 구조적 타당성을 평가하며 통계적 유의성을 검증하여 신뢰성을 평가한다. 구조적 타당성 검정을 위해 확인 요인분석을 수행하며 평가모형의 신뢰성을 검증하기 위해서는 다변량 통계방법 중의 하나인 판별분석을 수행한다. 본 연구는 기술개발 성공 및 부실발생의 예측력을 갖는 기술신용평가 시스템 구축을 위한 기초 자료로 활용될 수 있을 것이다.

  • PDF

Selection of Input Nodes in Artificial Neural Network for Bankruptcy Prediction by Link Weight Analysis Approach (연결강도분석접근법에 의한 부도예측용 인공신경망 모형의 입력노드 선정에 관한 연구)

  • 이응규;손동우
    • Journal of Intelligence and Information Systems
    • /
    • v.7 no.2
    • /
    • pp.19-33
    • /
    • 2001
  • Link weight analysis approach is suggested as a heuristic for selection of input nodes in artificial neural network for bankruptcy prediction. That is to analyze each input node\\\\`s link weight-absolute value of link weight between an input node and a hidden node in a well-trained neural network model. Prediction accuracy of three methods in this approach, -weak-linked-neurons elimination method, strong-linked-neurons selection method and integrated link weight model-is compared with that of decision tree and multivariate discrimination analysis. In result, the methods suggested in this study show higher accuracy than decision tree and multivariate discrimination analysis. Especially an integrated model has much higher accuracy than any individual models.

  • PDF

Analysis on the Correction Factor of Emission Factors and Verification for Fuel Consumption Differences by Road Types and Time Using Real Driving Data (실 주행 자료를 이용한 도로유형·시간대별 연료소모량 차이 검증 및 배출계수 보정 지표 분석)

  • LEE, Kyu Jin;CHOI, Keechoo
    • Journal of Korean Society of Transportation
    • /
    • v.33 no.5
    • /
    • pp.449-460
    • /
    • 2015
  • The reliability of air quality evaluation results for green transportation could be improved by applying correct emission factors. Unlike previous studies, which estimated emission factors that focused on vehicles in laboratory experiments, this study investigates emission factors according to road types and time using real driving data. The real driving data was collected using a Portable Activity Monitoring System (PAMS) according to road types and time, which it compared and analyzed fuel consumption from collected data. The result of the study shows that fuel consumption on national highway is 17.33% higher than the fuel consumption on expressway. In addition, the average fuel consumption of peak time is 4.7% higher than that of non-peak time for 22.5km/h. The difference in fuel consumption for road types and time is verified using ANOCOVA and MANOVA. As a result, the hypothesis of this study - that fuel consumption differs according to road types and time, even if the travel speed is the same - has proved valid. It also suggests correction factor of emission factors by using the difference in fuel consumption. It is highly expected that this study can improve the reliability of emissions from mobile pollution sources.

Joint model of longitudinal data with informative observation time and competing risk (결시적 자료에서 관측 중단을 모형화하기 위해 사용되는 경쟁 위험의 적용과 결합 모형)

  • Kim, Yang-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.1
    • /
    • pp.113-122
    • /
    • 2016
  • Longitudinal data often occur in prospective follow-up studies. Joint model for longitudinal data and failure time has been applied on several works. In this paper, we extend it to the case where longitudinal data involve informative observation time process as well as competing risks survival times. We use a likelihood approach and derive an EM algorithm to obtain maximum likelihood estimate of parameters. A suggested joint model allows us to make inferences for three components: longitudinal outcome, observation time process and competing risk failure time. In addition, we can test the association among these components. In this paper, liver cirrhosis patients' data is analyzed. The relationship between prothrombin times measured at irregular visiting times and drop outs is investigated with a joint model.

Locally adaptive intelligent interpolation for population distribution modeling using pre-classified land cover data and geographically weighted regression (지표피복 데이터와 지리가중회귀모형을 이용한 인구분포 추정에 관한 연구)

  • Kim, Hwahwan
    • Journal of the Korean association of regional geographers
    • /
    • v.22 no.1
    • /
    • pp.251-266
    • /
    • 2016
  • Intelligent interpolation methods such as dasymetric mapping are considered to be the best way to disaggregate zone-based population data by observing and utilizing the internal variation within each source zone. This research reviews the advantages and problems of the dasymetric mapping method, and presents a geographically weighted regression (GWR) based method to take into consideration the spatial heterogeneity of population density - land cover relationship. The locally adaptive intelligent interpolation method is able to make use of readily available ancillary information in the public domain without the need for additional data processing. In the case study, we use the preclassified National Land Cover Dataset 2011 to test the performance of the proposed method (i.e. the GWR-based multi-class dasymetric method) compared to four other popular population estimation methods (i.e. areal weighting interpolation, pycnophylactic interpolation, binary dasymetric method, and globally fitted ordinary least squares (OLS) based multi-class dasymetric method). The GWR-based multi-class dasymetric method outperforms all other methods. It is attributed to the fact that spatial heterogeneity is accounted for in the process of determining density parameters for land cover classes.

  • PDF

A Study on the Node Split in Decision Tree with Multivariate Target Variables (다변량 목표변수를 갖는 의사결정나무의 노드분리에 관한 연구)

  • Kim, Seong-Jun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.386-390
    • /
    • 2003
  • Data mining is a process of discovering useful patterns for decision making from an amount of data. It has recently received much attention in a wide range of business and engineering fields. Classifying a group into subgroups is one of the most important subjects in data mining. Tree-based methods, known as decision trees, provide an efficient way to finding the classification model. The primary concern in tree learning is to minimize a node impurity, which is evaluated using a target variable in the data set. However, there are situations where multiple target variable should be taken into account, for example, such as manufacturing process monitoring, marketing science, and clinical and health analysis. The purpose of this article is to present some methods for measuring the node impurity, which are applicable to data sets with multivariate target variables. For illustration, a numerical cxample is given with discussion.