• Title/Summary/Keyword: stochastic regression model

Search Result 68, Processing Time 0.02 seconds

The Temporal Disaggregation Model for Nonlinear Pan Evaporation Estimation (비선형 증발접시 증발량 산정을 위한 시간적 분해모형)

  • Kim, Sungwon;Kim, Jung-Hun;Park, Ki-Bum;Kim, Hung Soo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.4B
    • /
    • pp.399-412
    • /
    • 2010
  • The goal of this research is to apply the neural networks models for the temporal disaggregation of the yearly pan evaporation (PE) data, Republic of Korea. The neural networks models consist of multilayer perceptron neural networks model (MLP-NNM) and generalized regression neural networks model (GRNNM), respectively. And, for the performances evaluation of the neural networks models, they are composed of training and test performances, respectively. The three types of data such as the historic, the generated, and the mixed data are used for the training performance. The only historic data, however, is used for the testing performance. From this research, we evaluate the application of MLP-NNM and GRNNM for the temporal disaggregation of nonlinear time series data. We should, furthermore, construct the credible monthly PE data from the temporal disaggregation of the yearly PE data, and can suggest the available data for the evaluation of irrigation and drainage networks system.

A Dynamic Shortest Path Finding Model using Hierarchical Road Networks (도로 위계 구조를 고려한 동적 최적경로 탐색 기법개발)

  • Kim, Beom-Il;Lee, Seung-Jae
    • Journal of Korean Society of Transportation
    • /
    • v.23 no.6 s.84
    • /
    • pp.91-102
    • /
    • 2005
  • When it comes to the process of information storage, people are likely to organize individual information into the forms of groups rather than independent attributes, and put them together in their brains. Likewise, in case of finding the shortest path, this study suggests that a Hierarchical Road Network(HRN) model should be selected to browse the most desirable route, since the HRN model takes the process mentioned above into account. Moreover, most of drivers make a decision to select a route from origin to destination by road hierarchy. It says that the drivers feel difference between the link travel tine which was measured by driving and the theoretical link travel time. There is a different solution which has predicted the link travel time to solve this problem. By using this solution, the link travel time is predicted based on link conditions from time to time. The predicated link travel time is used to search the shortest path. Stochastic Process model uses the historical patterns of travel time conditions on links. The HRN model has compared favorably with the conventional shortest path finding model in tern of calculated speeds. Even more, the result of the shortest path using the HRN model has more similar to the survey results which was conducted to the taxi drivers. Taxi drivers have a strong knowledge of road conditions on the road networks and they are more likely to select a shortest path according to the real common sense.

Statistical Prediction for the Demand of Life Insurance Policy Loans (생명보험의 보험계약대출 수요에 대한통계적예측)

  • Lee, Woo-Joo;Park, Kyung-Ok;Kim, Hae-Kyung
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.5
    • /
    • pp.697-712
    • /
    • 2010
  • This paper is concerned with the statistical analysis and development of stochastic models for the demand for life insurance policy loans. For these, firstly the characteristics of the regression trend, periodicity and dependence of the monthly demand for life insurance policy loans are investigated by a statistical analysis of the monthly demand data for the years 1999 through 2008. Secondly, the causal relationships between the demand for life insurance policy loans and the economic variables including unemployment rate and inflation rate for the period are investigated. The results show that inflation rate is main factor influencing policy loan demands. The overall evidence, however, failed to establish unidirectional causality relationships between the demand series and the other variables under study. Finally, based on these, univariate time series model and transfer function model where the demand series is related to one input series are derived, respectively, for the prediction of the demand for life insurance policy loans. A statistical procedure for using the model to predict the demand for life insurance policy loans is also proposed.

Investigating the Association between Residual State Ownership and Privatized Firm Efficiency

  • NGUYEN, Manh Hoang;VO, Quy Thi
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.5
    • /
    • pp.225-236
    • /
    • 2020
  • This paper examines empirically the net impact of residual state ownership on privatized firm efficiency in the transitional context of Vietnam. Vietnamese privatization has its own characteristics. Instead of mass and full privatization, Vietnam has chosen a partial and gradual path. Thus, it is important to assess the net impact of residual state ownership on privatized firms during the post-privatization period. This study employs stochastic frontier analysis to investigate the association between residual state ownership and the efficiency of privatized firms, using a sample of all privatized firms that are listed on the Vietnamese stock exchanges over the period from 2007 to 2017. Also, two-stage least squares regression is incorporated into the model to deal with potential endogeneity issues. Our study provides evidence that state ownership should not be considered as a pure source of agency problems. Indeed, the net impact of residual state ownership on privatized firm efficiency is non-monotonic, and the relationship between residual state ownership and privatized firm efficiency is under an inverted U-shape. A moderate level (less than 50%) of residual state ownership might be beneficial to privatized firm efficiency whereas too much state ownership is detrimental to the efficiency of privatized firms.

Improvement of inspection system for common crossings by track side monitoring and prognostics

  • Sysyn, Mykola;Nabochenko, Olga;Kovalchuk, Vitalii;Gruen, Dimitri;Pentsak, Andriy
    • Structural Monitoring and Maintenance
    • /
    • v.6 no.3
    • /
    • pp.219-235
    • /
    • 2019
  • Scheduled inspections of common crossings are one of the main cost drivers of railway maintenance. Prognostics and health management (PHM) approach and modern monitoring means offer many possibilities in the optimization of inspections and maintenance. The present paper deals with data driven prognosis of the common crossing remaining useful life (RUL) that is based on an inertial monitoring system. The problem of scheduled inspections system for common crossings is outlined and analysed. The proposed analysis of inertial signals with the maximal overlap discrete wavelet packet transform (MODWPT) and Shannon entropy (SE) estimates enable to extract the spectral features. The relevant features for the acceleration components are selected with application of Lasso (Least absolute shrinkage and selection operator) regularization. The features are fused with time domain information about the longitudinal position of wheels impact and train velocities by multivariate regression. The fused structural health (SH) indicator has a significant correlation to the lifetime of crossing. The RUL prognosis is performed on the linear degradation stochastic model with recursive Bayesian update. Prognosis testing metrics show the promising results for common crossing inspection scheduling improvement.

Numerical studies on approximate option prices (근사적 옵션 가격의 수치적 비교)

  • Yoon, Jeongyoen;Seung, Jisu;Song, Seongjoo
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.2
    • /
    • pp.243-257
    • /
    • 2017
  • In this paper, we compare several methods to approximate option prices: Edgeworth expansion, A-type and C-type Gram-Charlier expansions, a method using normal inverse gaussian (NIG) distribution, and an asymptotic method using nonlinear regression. We used two different types of approximation. The first (called the RNM method) approximates the risk neutral probability density function of the log return of the underlying asset and computes the option price. The second (called the OPTIM method) finds the approximate option pricing formula and then estimates parameters to compute the option price. For simulation experiments, we generated underlying asset data from the Heston model and NIG model, a well-known stochastic volatility model and a well-known Levy model, respectively. We also applied the above approximating methods to the KOSPI200 call option price as a real data application. We then found that the OPTIM method shows better performance on average than the RNM method. Among the OPTIM, A-type Gram-Charlier expansion and the asymptotic method that uses nonlinear regression showed relatively better performance; in addition, among RNM, the method of using NIG distribution was relatively better than others.

Application of Text-Classification Based Machine Learning in Predicting Psychiatric Diagnosis (텍스트 분류 기반 기계학습의 정신과 진단 예측 적용)

  • Pak, Doohyun;Hwang, Mingyu;Lee, Minji;Woo, Sung-Il;Hahn, Sang-Woo;Lee, Yeon Jung;Hwang, Jaeuk
    • Korean Journal of Biological Psychiatry
    • /
    • v.27 no.1
    • /
    • pp.18-26
    • /
    • 2020
  • Objectives The aim was to find effective vectorization and classification models to predict a psychiatric diagnosis from text-based medical records. Methods Electronic medical records (n = 494) of present illness were collected retrospectively in inpatient admission notes with three diagnoses of major depressive disorder, type 1 bipolar disorder, and schizophrenia. Data were split into 400 training data and 94 independent validation data. Data were vectorized by two different models such as term frequency-inverse document frequency (TF-IDF) and Doc2vec. Machine learning models for classification including stochastic gradient descent, logistic regression, support vector classification, and deep learning (DL) were applied to predict three psychiatric diagnoses. Five-fold cross-validation was used to find an effective model. Metrics such as accuracy, precision, recall, and F1-score were measured for comparison between the models. Results Five-fold cross-validation in training data showed DL model with Doc2vec was the most effective model to predict the diagnosis (accuracy = 0.87, F1-score = 0.87). However, these metrics have been reduced in independent test data set with final working DL models (accuracy = 0.79, F1-score = 0.79), while the model of logistic regression and support vector machine with Doc2vec showed slightly better performance (accuracy = 0.80, F1-score = 0.80) than the DL models with Doc2vec and others with TF-IDF. Conclusions The current results suggest that the vectorization may have more impact on the performance of classification than the machine learning model. However, data set had a number of limitations including small sample size, imbalance among the category, and its generalizability. With this regard, the need for research with multi-sites and large samples is suggested to improve the machine learning models.

Bus stop passenger waiting simulation considering transfer passengers: A case study at Cheongju Intercity Bus Terminal (환승객을 고려한 버스 정류장 승객 대기 시뮬레이션: 청주 시외 버스 터미널 정류장 사례 연구)

  • Lee, Jongsung
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.4
    • /
    • pp.217-228
    • /
    • 2021
  • After the integrated fare system has been applied, public transportation and transfer traffic increased. As a result, transfer passengers must be considered in the operation of the bus. Although previous studies have limitations due to utilizing deterministic mathematical models, which fails to reflect the stochastic movements of passengers and buses, in this study, a more realistic bus stop micro-simulation model is proposed. Based on the proposed simulation model, we represent the relationship between bus arrival interval and passenger wait time as a regression model and empirically show the differences between the cases with and without transfer passengers. Also, we propose a method converting passenger waiting time to cost and find optimal bus arrival interval based on the converted cost. It is expected the proposed method enables bottom-up decision making reflecting practical situation.

Technical Inefficiency in Korea's Manufacturing Industries (한국(韓國) 제조업(製造業)의 기술적(技術的) 효율성(效率性) : 산업별(産業別) 기술적(技術的) 효율성(效率性)의 추정(推定))

  • Yoo, Seong-min;Lee, In-chan
    • KDI Journal of Economic Policy
    • /
    • v.12 no.2
    • /
    • pp.51-79
    • /
    • 1990
  • Research on technical efficiency, an important dimension of market performance, had received little attention until recently by most industrial organization empiricists, the reason being that traditional microeconomic theory simply assumed away any form of inefficiency in production. Recently, however, an increasing number of research efforts have been conducted to answer questions such as: To what extent do technical ineffciencies exist in the production activities of firms and plants? What are the factors accounting for the level of inefficiency found and those explaining the interindustry difference in technical inefficiency? Are there any significant international differences in the levels of technical efficiency and, if so, how can we reconcile these results with the observed pattern of international trade, etc? As the first in a series of studies on the technical efficiency of Korea's manufacturing industries, this paper attempts to answer some of these questions. Since the estimation of technical efficiency requires the use of plant-level data for each of the five-digit KSIC industries available from the Census of Manufactures, one may consture the findings of this paper as empirical evidence of technical efficiency in Korea's manufacturing industries at the most disaggregated level. We start by clarifying the relationship among the various concepts of efficiency-allocative effciency, factor-price efficiency, technical efficiency, Leibenstein's X-efficiency, and scale efficiency. It then becomes clear that unless certain ceteris paribus assumptions are satisfied, our estimates of technical inefficiency are in fact related to factor price inefficiency as well. The empirical model employed is, what is called, a stochastic frontier production function which divides the stochastic term into two different components-one with a symmetric distribution for pure white noise and the other for technical inefficiency with an asymmetric distribution. A translog production function is assumed for the functional relationship between inputs and output, and was estimated by the corrected ordinary least squares method. The second and third sample moments of the regression residuals are then used to yield estimates of four different types of measures for technical (in) efficiency. The entire range of manufacturing industries can be divided into two groups, depending on whether or not the distribution of estimated regression residuals allows a successful estimation of technical efficiency. The regression equation employing value added as the dependent variable gives a greater number of "successful" industries than the one using gross output. The correlation among estimates of the different measures of efficiency appears to be high, while the estimates of efficiency based on different regression equations seem almost uncorrelated. Thus, in the subsequent analysis of the determinants of interindustry variations in technical efficiency, the choice of the regression equation in the previous stage will affect the outcome significantly.

  • PDF

Development of Statistical Downscaling Model Using Nonstationary Markov Chain (비정상성 Markov Chain Model을 이용한 통계학적 Downscaling 기법 개발)

  • Kwon, Hyun-Han;Kim, Byung-Sik
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.3
    • /
    • pp.213-225
    • /
    • 2009
  • A stationary Markov chain model is a stochastic process with the Markov property. Having the Markov property means that, given the present state, future states are independent of the past states. The Markov chain model has been widely used for water resources design as a main tool. A main assumption of the stationary Markov model is that statistical properties remain the same for all times. Hence, the stationary Markov chain model basically can not consider the changes of mean or variance. In this regard, a primary objective of this study is to develop a model which is able to make use of exogenous variables. The regression based link functions are employed to dynamically update model parameters given the exogenous variables, and the model parameters are estimated by canonical correlation analysis. The proposed model is applied to daily rainfall series at Seoul station having 46 years data from 1961 to 2006. The model shows a capability to reproduce daily and seasonal characteristics simultaneously. Therefore, the proposed model can be used as a short or mid-term prediction tool if elaborate GCM forecasts are used as a predictor. Also, the nonstationary Markov chain model can be applied to climate change studies if GCM based climate change scenarios are provided as inputs.