• Title/Summary/Keyword: 일반화선형모형

Search Result 148, Processing Time 0.027 seconds

Prediction of movie audience numbers using hybrid model combining GLS and Bass models (GLS와 Bass 모형을 결합한 하이브리드 모형을 이용한 영화 관객 수 예측)

  • Kim, Bokyung;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.4
    • /
    • pp.447-461
    • /
    • 2018
  • Domestic film industry sales are increasing every year. Theaters are the primary sales channels for movies and the number of audiences using the theater affects additional selling rights. Therefore, the number of audiences using the theater is an important factor directly linked to movie industry sales. In this paper we consider a hybrid model that combines a multiple linear regression model and the Bass model to predict the audience numbers for a specific day. By combining the two models, the predictive value of the regression analysis was corrected to that of the Bass model. In the analysis, three films with different release dates were used. All subset regression method is used to generate all possible combinations and 5-fold cross validation to estimate the model 5 times. In this case, the predicted value is obtained from the model with the smallest root mean square error and then combined with the predicted value of the Bass model to obtain the final predicted value. With the existence of past data, it was confirmed that the weight of the Bass model increases and the compensation is added to the predicted value.

A Stochastic Transit Assignment Model for Intercity Rail Network (지역간 철도의 확률적 통행배정모형 구측 연구)

  • Kwon, Yong-Seok;Kim, Kyoung-Tae;Lim, Chong-Hoon
    • Journal of the Korean Society for Railway
    • /
    • v.12 no.4
    • /
    • pp.488-498
    • /
    • 2009
  • The characteristics of intercity rail network are different from those of public transit network in urban area. In this paper, we proposed a new transit assignment model which is generalized form of deterministic assignment model by introducing line selection probability on route section. This model consider various characteristics of intercity rail and simplify network expansion for appling search algorithms developed in road assignment model. We showed the model availability by comparing with existing models using virtual networks. The tests on a small scale network show that this model is superior to existing models for predicting intercity rail demand.

Predicting claim size in the auto insurance with relative error: a panel data approach (상대오차예측을 이용한 자동차 보험의 손해액 예측: 패널자료를 이용한 연구)

  • Park, Heungsun
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.697-710
    • /
    • 2021
  • Relative error prediction is preferred over ordinary prediction methods when relative/percentile errors are regarded as important, especially in econometrics, software engineering and government official statistics. The relative error prediction techniques have been developed in linear/nonlinear regression, nonparametric regression using kernel regression smoother, and stationary time series models. However, random effect models have not been used in relative error prediction. The purpose of this article is to extend relative error prediction to some of generalized linear mixed model (GLMM) with panel data, which is the random effect models based on gamma, lognormal, or inverse gaussian distribution. For better understanding, the real auto insurance data is used to predict the claim size, and the best predictor and the best relative error predictor are comparatively illustrated.

OD trip matrix estimation from urban link traffic counts (comparison with GA and SAB algorithm) (링크관측교통량을 이용한 도시부 OD 통행행렬 추정 (GA와 SAB 알고리즘의 비교를 중심으로))

  • 백승걸;김현명;임용택;임강원
    • Journal of Korean Society of Transportation
    • /
    • v.18 no.6
    • /
    • pp.89-99
    • /
    • 2000
  • To cope with the limits of conventional O-D trip matrix collecting methods, several approaches have been developed. One of them is bilevel Programming method Proposed by Yang(1995), which uses Sensitivity Analysis Based(SAB) algorithm to solve Generalized Least Square(GLS) problem. However, the SAB a1gorithm has revealed two critical short-comings. The first is that when there exists a significant difference between target O-D matrix and true O-D matrix, SAB algorithm may not produce correct solution. This stems from the heavy dependance on the historical O-D information, in special when gravel Patterns are dramatically changed. The second is the assumption of iterative linear approximation to original Problem. Because of the approximation, SAB algorithm has difficulty in converging to Perfect Stackelberg game condition. So as to avoid the Problems. we need a more robust and stable solution method. The main purpose of this Paper is to show the problem of the dependency of Previous models and to Propose an alternative solution method to handle it. The Problem of O-D matrix estimation is intrinsically nonlinear and nonconvex. thus it has multiple solutions. Therefore it is necessary to require a method for searching globa1 solution. In this paper, we develop a solution algorithm combined with genetic algorithm(GA) , which is widely used as probabilistic global searching method To compare the efficiency of the algorithm, SAB algorithm suggested by Yang et al. (1992,1995) is used. From the results of numerical example, the Proposed algorithm is superior to SAB algorithm irrespective of travel patterns.

  • PDF

A Study for the Drivers of Movie Box-office Performance (영화흥행 영향요인 선택에 관한 연구)

  • Kim, Yon Hyong;Hong, Jeong Han
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.441-452
    • /
    • 2013
  • This study analyzed the relationship between key film and a box office record success factors based on movies released in the first quarter of 2013 in Korea. An over-fitting problem can happen if there are too many explanatory variables inserted to regression model; in addition, there is a risk that the estimator is instable when there is multi-collinearity among the explanatory variables. For this reason, optimal variable selection based on high explanatory variables in box-office performance is of importance. Among the numerous ways to select variables, LASSO estimation applied by a generalized linear model has the smallest prediction error that can efficiently and quickly find variables with the highest explanatory power to box-office performance in order.

The Prediction of Currency Crises through Artificial Neural Network (인공신경망을 이용한 경제 위기 예측)

  • Lee, Hyoung Yong;Park, Jung Min
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.4
    • /
    • pp.19-43
    • /
    • 2016
  • This study examines the causes of the Asian exchange rate crisis and compares it to the European Monetary System crisis. In 1997, emerging countries in Asia experienced financial crises. Previously in 1992, currencies in the European Monetary System had undergone the same experience. This was followed by Mexico in 1994. The objective of this paper lies in the generation of useful insights from these crises. This research presents a comparison of South Korea, United Kingdom and Mexico, and then compares three different models for prediction. Previous studies of economic crisis focused largely on the manual construction of causal models using linear techniques. However, the weakness of such models stems from the prevalence of nonlinear factors in reality. This paper uses a structural equation model to analyze the causes, followed by a neural network model to circumvent the linear model's weaknesses. The models are examined in the context of predicting exchange rates In this paper, data were quarterly ones, and Consumer Price Index, Gross Domestic Product, Interest Rate, Stock Index, Current Account, Foreign Reserves were independent variables for the prediction. However, time periods of each country's data are different. Lisrel is an emerging method and as such requires a fresh approach to financial crisis prediction model design, along with the flexibility to accommodate unexpected change. This paper indicates the neural network model has the greater prediction performance in Korea, Mexico, and United Kingdom. However, in Korea, the multiple regression shows the better performance. In Mexico, the multiple regression is almost indifferent to the Lisrel. Although Lisrel doesn't show the significant performance, the refined model is expected to show the better result. The structural model in this paper should contain the psychological factor and other invisible areas in the future work. The reason of the low hit ratio is that the alternative model in this paper uses only the financial market data. Thus, we cannot consider the other important part. Korea's hit ratio is lower than that of United Kingdom. So, there must be the other construct that affects the financial market. So does Mexico. However, the United Kingdom's financial market is more influenced and explained by the financial factors than Korea and Mexico.

A Numerical Analysis of the Shallow Water Equations Using the HLLL Approximate Riemann Solver (HLLL 근사 Riemann 해법을 이용한 천수방정식의 수치해석)

  • Hwang, Seung-Yong;Lee, Sam-Hee
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2011.05a
    • /
    • pp.148-148
    • /
    • 2011
  • Riemann 문제는 천수방정식과 같은 쌍곡선형 방정식과 단일한 도약에 의해 불연속인 어떤 점의 좌 우에서 상수인 자료로 구성되는 초기치 문제로서 그 해법은 Godunov 방법과 같이 정확해에 의하면 정확 Riemann 해법, 근사 기법에 의하면 근사 Riemann 해법으로 불린다. 지금까지 이용되는 근사 Riemann 해법으로는 1981년에 P. L. Roe가 제안한 Roe의 선형화 기법과 1983년에 A. Harten, P. D. Lax, 그리고 B. van Leer가 제안한 HLL 기법의 수정 기법들이다. 최대 및 최소 파속만 고려하는 것으로 알려진 HLL 기법은 1988년에 B. Einfeldt의 제안에 의해 두 파속의 결정에서 Roe의 선형화 기법에 따른 고유치와 비교하는 것으로 수정되었다(HLLE 기법). 또한, 1994년에 E. F. Toro 등은 접촉파를 고려하기 위해 선형화된 지배방정식의 정확해로부터 중앙 파속을 고려하는 기법을 제안하였고, 이를 HLLC 기법으로 불렀다. 2002년에 T. Linde는 중앙 파속을 평가하기 위해 일반화된(수학적) 엔트로피 함수를 도입하였으며, van Leer는 이를 HLLL 기법으로 불렀다. 이 기법에서는 접촉파의 평가를 위해 보존변수에 대한 일반화된 엔트로피 함수로부터 중앙 파속이 유도되며, 이것과 특성 속도의 비교를 통해 최대 및 최소 파속이 결정된다. 따라서 이 기법에서는 모든 파속이 초기치로부터 결정되므로 HLLE 기법과 달리 Roe의 선형화 기법과 완전히 결별되고 HLLC 기법과 달리 정확해에 의존되지 않는 점에서 HLLL 기법은 모태인 HLL 기법의 온전한 계승으로 볼 수 있다. HLLL 기법은 여러 분야에 적용된 바 있으나, 수공학 분야에 적용된 사례는 알려진 바 없다. 이는 천수방정식에 대한 (물리적) 엔트로피 함수가 명확하지 않기 때문인 것으로 보인다. 이 연구에서는 보존변수로부터 정의되는 총 에너지를 일반화된 엔트로피 함수로 간주하여 모형을 구성하고, 정확해가 알려진 1차원 문제에 대해 적용성을 검토하였다. 정확해가 알려진 경우에 대해 모의한 결과, 1차 정도 수치해의 한계에도 불구하고, HLLL 기법의 결과는 대체로 정확해와 잘 일치하였으며 그 외의 HLL-형 기법의 그것에 비해 우수한 것으로 나타났다. 특히, 물이 빠져 바닥이 드러나는 상태에 대한 접촉 파속의 추정에서 Riemann 불변량을 이용하는 HLLC 기법에 비해 물이 빠지는 전선을 더 정확하게 포착하는 HLLL 기법의 결과는 매우 고무적이었다.

  • PDF

Estimation of Asymmetric Bell Shaped Probability Curve using Logistic Regression (로지스틱 회귀모형을 이용한 비대칭 종형 확률곡선의 추정)

  • 박성현;김기호;이소형
    • The Korean Journal of Applied Statistics
    • /
    • v.14 no.1
    • /
    • pp.71-80
    • /
    • 2001
  • Logistic regression model is one of the most popular linear models for a binary response variable and used for the estimation of probability function. In many practical situations, the probability function can be expressed by a bell shaped curve and such a function can be estimated by a second order logistic regression model. However, when the probability curve is asymmetric, the estimation results using a second order logistic regression model may not be precise because a second order logistic regression model is a symmetric function. In addition, even if a second order logistic regression model is used, the interpretation for the effect of second order term may not be easy. In this paper, in order to alleviate such problems, an estimation method for asymmetric probabiity curve based on a first order logistic regression model and iterative bi-section method is proposed and its performance is compared with that of a second order logistic regression model by a simulation study.

  • PDF

Analysis of scientific military training data using zero-inflated and Hurdle regression (영과잉 및 허들 회귀모형을 이용한 과학화 전투훈련 자료 분석)

  • Kim, Jaeoh;Bang, Sungwan;Kwon, Ojeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.6
    • /
    • pp.1511-1520
    • /
    • 2017
  • The purpose of this study is to analyze military combat training data to improve military operation and training methods and verify required military doctrine. We set the number of combat disabled enemies, which the individual combatants make using their weapons, as the response variable regarding offensive operations from scientific military training data of reinforced infantry battalion. Our response variable has more zero observations than would be allowed for by the traditional GLM such as Poisson regression. We used the zero-inflated regression and the hurdle regression for data analysis considering the over-dispersion and excessive zero observation problems. Our result can be utilized as an appropriate reference in order to verify a military doctrine for small units and analysis of various operational and tactical factors.