• Title/Summary/Keyword: Regression estimators

Search Result 227, Processing Time 0.023 seconds

STATISTICALLY PREPROCESSED DATA BASED PARAMETRIC COST MODEL FOR BUILDING PROJECTS

  • Sae-Hyun Ji;Moonseo Park;Hyun-Soo Lee
    • International conference on construction engineering and project management
    • /
    • 2009.05a
    • /
    • pp.417-424
    • /
    • 2009
  • For a construction project to progress smoothly, effective cost estimation is vital, particularly in the conceptual and schematic design stages. In these early phases, despite the fact that initial estimates are highly sensitive to changes in project scope, owners require accurate forecasts which reflect their supplying information. Thus, cost estimators need effective estimation strategies. Practically, parametric cost estimates are the most commonly used method in these initial phases, which utilizes historical cost data (Karshenas 1984, Kirkham 2007). Hence, compilation of historical data regarding appropriate cost variance governing parameters is a prime requirement. However, precedent practice of data mining (data preprocessing) for denoising internal errors or abnormal values is needed before compilation. As an effort to deal with this issue, this research proposed a statistical methodology for data preprocessing and verified that data preprocessing has a positive impact on the enhancement of estimate accuracy and stability. Moreover, Statistically Preprocessed data Based Parametric (SPBP) cost models are developed based on multiple regression equations and verified their effectiveness compared with conventional cost models.

  • PDF

Penalized maximum likelihood estimation with symmetric log-concave errors and LASSO penalty

  • Seo-Young, Park;Sunyul, Kim;Byungtae, Seo
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.6
    • /
    • pp.641-653
    • /
    • 2022
  • Penalized least squares methods are important tools to simultaneously select variables and estimate parameters in linear regression. The penalized maximum likelihood can also be used for the same purpose assuming that the error distribution falls in a certain parametric family of distributions. However, the use of a certain parametric family can suffer a misspecification problem which undermines the estimation accuracy. To give sufficient flexibility to the error distribution, we propose to use the symmetric log-concave error distribution with LASSO penalty. A feasible algorithm to estimate both nonparametric and parametric components in the proposed model is provided. Some numerical studies are also presented showing that the proposed method produces more efficient estimators than some existing methods with similar variable selection performance.

An Analysis for the Structural Variation in the Unemployment Rate and the Test for the Turning Point (실업률 변동구조의 분석과 전환점 진단)

  • Kim, Tae-Ho;Hwang, Sung-Hye;Lee, Young-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.253-269
    • /
    • 2005
  • One of the basic assumptions of the regression models is that the parameter vector does not vary across sample observations. If the parameter vector is not constant for all observations in the sample, the statistical model is changed and the usual least squares estimators do not yield unbiased, consistent and efficient estimates. This study investigates the regression model with some or all parameters vary across partitions of the whole sample data when the model permits different response coefficients during unusual time periods. Since the usual test for overall homogeneity of regressions across partitions of the sample data does not explicitly identify the break points between the partitions, the testing the equality between subsets of coefficients in two or more linear regressions is generalized and combined with the test procedure to search the break point. The method is applied to find the possibility and the turning point of the structural change in the long-run unemployment rate in the usual static framework by using the regression model. The relationships between the variables included in the model are reexamined in the dynamic framework by using Vector Autoregression.

Upgraded quadratic inference functions for longitudinal data with type II time-dependent covariates

  • Cho, Gyo-Young;Dashnyam, Oyunchimeg
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.211-218
    • /
    • 2014
  • Qu et. al. (2000) proposed the quadratic inference functions (QIF) method to marginal model analysis of longitudinal data to improve the generalized estimating equations (GEE). It yields a substantial improvement in efficiency for the estimators of regression parameters when the working correlation is misspecified. But for the longitudinal data with time-dependent covariates, when the implicit full covariates conditional mean (FCCM) assumption is violated, the QIF can not provide more consistent and efficient estimator than GEE (Cho and Dashnyam, 2013). Lai and Small (2007) divided time-dependent covariates into three types and proposed generalized method of moment (GMM) for longitudinal data with time-dependent covariates. They showed that their GMM type II and GMM moment selection methods can be more ecient than GEE with independence working correlation (GEE-ind) in the case of type II time-dependent covariates. We develop upgraded QIF method for type II time-dependent covariates. We show that this upgraded QIF method can provide substantial gains in efficiency over QIF and GEE-ind in the case of type II time-dependent covariates.

Shrinkage Small Area Estimation Using a Semiparametric Mixed Model (준모수혼합모형을 이용한 축소소지역추정)

  • Jeong, Seok-Oh;Choo, Manho;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.4
    • /
    • pp.605-617
    • /
    • 2014
  • Small area estimation is a statistical inference method to overcome large variance due to a small sample size allocated in a small area. A shrinkage estimator obtained by minimizing relative error(RE) instead of MSE has been suggested. The estimator takes advantage of good interpretation when the data range is large. A semiparametric estimator is also studied for small area estimation. In this study, we suggest a semiparametric shrinkage small area estimator and compare small area estimators using labor statistics.

Empirical Study of the Long-Term Memory Effect of the KOSPI200 Earning rate volatility (KOSPI200 수익률 변동성의 장기기억과정탐색)

  • Choi, Sang-Kyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.12
    • /
    • pp.7018-7024
    • /
    • 2014
  • This study examined the squared returns and absolute returns of KOSPI 200 with GPH (Geweke and Porter-Hudak, 1983) estimators. GPH was estimated by the long-term memory preserving time series parameter d in linear regression. This called the GPH estimator, which depends on a bandwidth m. m was decided by confirming the stable section of the point estimate by validating the track of the GPH estimator according to the value of m. The result suggests that by satisfying 0< d <0.5, the squared returns and absolute returns of KOPI 200 retains long-term memory.

Preliminary test estimation method accounting for error variance structure in nonlinear regression models (비선형 회귀모형에서 오차의 분산에 따른 예비검정 추정방법)

  • Yu, Hyewon;Lim, Changwon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.4
    • /
    • pp.595-611
    • /
    • 2016
  • We use nonlinear regression models (such as the Hill Model) when we analyze data in toxicology and/or pharmacology. In nonlinear regression models an estimator of parameters and estimation of measurement about uncertainty of the estimator are influenced by the variance structure of the error. Thus, estimation methods should be different depending on whether the data are homoscedastic or heteroscedastic. However, we do not know the variance structure of the error until we actually analyze the data. Therefore, developing estimation methods robust to the variance structure of the error is an important problem. In this paper we propose a method to estimate parameters in nonlinear regression models based on a preliminary test. We define an estimator which uses either the ordinary least square estimation method or the iterative weighted least square estimation method according to the results of a simple preliminary test for the equality of the error variance. The performance of the proposed estimator is compared to those of existing estimators by simulation studies. We also compare estimation methods using real data obtained from the National Toxicology program of the United States.

Estimating the Economic Value of the Songieong Beach Using A Count Data Model: - Off-season Estimating Value of the Beach - (가산자료모형을 이용한 송정 해수욕장의 경제적 가치추정: - 비수기 해수욕장의 가치추정 -)

  • Heo, Yun-Jeong;Lee, Seung-Lae
    • The Journal of Fisheries Business Administration
    • /
    • v.38 no.2
    • /
    • pp.79-101
    • /
    • 2007
  • The purpose of this study is to estimate the economic value of the Songieong Beach in Off-season, using a Individual Travel Cost Model(ITCM). Songieong Beach is located in Busan but far away from city. These days, however, the increased rate of traffic inflow to the Songieong beach and the five-day working week are reflected in the trend analysis. Moreover, people have changed psychological value. For that reason, visitors are on the increase on the beach in off-season. The ITCM is applied to estimate non-market value or environmental Good like a Contingent Valuation Method and Hedonic Price Model etc. The ITCM was derived from the Count Data Model(i.e. Poisson and Negative Binomial model). So this paper compares Poisson and negative binomial count data models to measure the tourism demands. The data for the study were collected from the Songjeong Beach on visitors over the a week from November 1 through November 23, 2006. Interviewers were instructed to interview only individuals. So the sample was taken in 113. A dependent variable that is defined on the non-negative integers and subject to sampling truncation is the result of a truncated count data process. This paper analyzes the effects of determinants on visitors' demand for exhibition using a class of maximum-likelihood regression estimators for count data from truncated samples, The count data and truncated models are used primarily to explain non-negative integer and truncation properties of tourist trips as suggested by the economic valuation literature. The results suggest that the truncated negative binomial model is improved overdispersion problem and more preferred than the other models in the study. This paper is not the same as the others. One thing is that Estimating Value of the Beach in off-season. The other thing is this study emphasizes in particular 'travel cost' that is not only monetary cost but also including opportunity cost of 'travel time'. According to the truncated negative binomial model, estimates the Consumer Surplus(CS) values per trip of about 199,754 Korean won and the total economic value was estimated to be 1,288,680 Korean won.

  • PDF

A bias adjusted ratio-type estimator (편향 보정 비형태추정량에 관한 연구)

  • Oh, Jung-Taek;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.3
    • /
    • pp.397-408
    • /
    • 2018
  • Various methods for accurate parameter estimation have been developed in a sample survey and it is also common to use a ratio estimator or the regression estimator using auxiliary information. The ratio-type estimator has been used in many recent studies and is known to improve the accuracy of estimation by adjusting the ratio estimator. However, various studies are under way to solve it since the ratio-type estimator is biased. In this study, we propose a generalized ratio-type estimator with a new parameter added to the ratio-type estimator to remove the bias. We suggested a method to apply this result to the parameter estimation under the error assumption of heteroscedasticity. Through simulation, we confirmed that the suggested generalized ratio-type estimator gives good results compared to conventional ratio-type estimators.

A Study on Consumer Sensibility of Adult Women's Town Wear (성인여성의 옷차림에 나타난 소비감성에 관한 연구)

  • Lee, Eun-Rung;Lee, Kyoung-Hee
    • Fashion & Textile Research Journal
    • /
    • v.10 no.1
    • /
    • pp.11-21
    • /
    • 2008
  • The purpose of this study was to provide the guidance in more objective and proper clothing design and a strategy of fashion marketing from consumer sensibilities about adult women's town wear in un-limited circumstance. The specific objectives were 1) to investigate relationship between fashion sensibility and consumer sensibility of Good and Bad women's town wear, 2) to compare fashion sensibility with consumer sensibility of Good and Bad women's town wear, and 3) to investigate a dimension of Good and Bad women's town wear in fashion sensibility. Because they can affect estimators, the collected photos at shopping mall, department stores, and churches (S/S, F/W: April 28, 2004~May 1, 2005) were prepared removing face and background and attached on gray board. To investigate consumer sensibilities, the stimulus were 80 photos (40 for 'good', 40 for 'Bad'). The questionnaire consisted of bi-polar 25 pairs adjective scale of consumer sensibility was distributed 600 female (20's~40's) living in Busan (June 28, 2005~July 11, 2005). The data were analyzed by ANOVA, Regression analysis, and MDS. The results of practical study are summarized as follows. Fashion sensibility on the preference in Good and Bad women's town wear is closely related in 'want to buy-do not want to buy' and buying need is 'like-dislike'. For the fashion sensibility dimension at Good women's town wear, X axis showed from Young to Adult and Y axis showed from Hard to Comfortable following positioned design characters. For the fashion sensibilities dimension at Bad women's town wear, X axis showed from Young to Adult, Y axis showed from Normal to Unique, and Z axis showed from Heavy to Light following positioned design characters.