• 제목/요약/키워드: Least squares fit

검색결과 85건 처리시간 0.025초

Estimating quantiles of extreme wind speed using generalized extreme value distribution fitted based on the order statistics

  • Liu, Y.X.;Hong, H.P.
    • Wind and Structures
    • /
    • 제34권6호
    • /
    • pp.469-482
    • /
    • 2022
  • The generalized extreme value distribution (GEVD) is frequently used to fit the block maximum of environmental parameters such as the annual maximum wind speed. There are several methods for estimating the parameters of the GEV distribution, including the least-squares method (LSM). However, the application of the LSM with the expected order statistics has not been reported. This study fills this gap by proposing a fitting method based on the expected order statistics. The study also proposes a plotting position to approximate the expected order statistics; the proposed plotting position depends on the distribution shape parameter. The use of this approximation for distribution fitting is carried out. Simulation analysis results indicate that the developed fitting procedure based on the expected order statistics or its approximation for GEVD is effective for estimating the distribution parameters and quantiles. The values of the probability plotting correlation coefficient that may be used to test the distributional hypothesis are calculated and presented. The developed fitting method is applied to extreme thunderstorm and non-thunderstorm winds for several major cities in Canada. Also, the implication of using the GEVD and Gumbel distribution to model the extreme wind speed on the structural reliability is presented and elaborated.

Simple factor analysis of measured data

  • Kozar, Ivica;Kozar, Danila Lozzi;Malic, Neira Toric
    • Coupled systems mechanics
    • /
    • 제11권1호
    • /
    • pp.33-41
    • /
    • 2022
  • Quite often we have a lot of measurement data and would like to find some relation between them. One common task is to see whether some measured data or a curve of known shape fit into the cumulative measured data. The problem can be visualized since data could generally be presented as curves or planes in Cartesian coordinates where each curve could be represented as a vector. In most cases we have measured the cumulative 'curve', we know shapes of other 'curves' and would like to determine unknown coefficients that multiply the known shapes in order to match the measured cumulative 'curve'. This problem could be presented in more complex variants, e.g., a constant could be added, some missing (unknown) data vector could be added to the measured summary vector, and instead of constant factors we could have polynomials, etc. All of them could be solved with slightly extended version of the procedure presented in the sequel. Solution procedure could be devised by reformulating the problem as a measurement problem and applying the generalized inverse of the measurement matrix. Measurement problem often has some errors involved in the measurement data but the least squares method that is comprised in the formulation quite successfully addresses the problem. Numerical examples illustrate the solution procedure.

Optimal Minimum Bias Designs for Model Discrimination

  • Park, Joong-Yang
    • Communications for Statistical Applications and Methods
    • /
    • 제5권2호
    • /
    • pp.339-351
    • /
    • 1998
  • Designs for discriminating between two linear regression models are studied under $\Lambda$-type optimalities maximizing the measure for the lack of fit for the designs with fixed model inadequacy. The problem of selecting an appropriate $\Lambda$-type optimalities is shown to be closely related to the estimation method. $\Lambda$-type optimalities for the least squares and minimum bias estimation methods are considered. The minimum bias designs are suggested for the designs invariant with respect to the two estimation methods. First order minimum bias designs optimal under $\Lambda$-type optimalities are then derived. Finally for the case where the lack of fit test is significant, an approach to the construction of a second order design accommodating the optimal first order minimum bias design is illustrated.

  • PDF

FUZZY REGRESSION MODEL WITH MONOTONIC RESPONSE FUNCTION

  • Choi, Seung Hoe;Jung, Hye-Young;Lee, Woo-Joo;Yoon, Jin Hee
    • 대한수학회논문집
    • /
    • 제33권3호
    • /
    • pp.973-983
    • /
    • 2018
  • Fuzzy linear regression model has been widely studied with many successful applications but there have been only a few studies on the fuzzy regression model with monotonic response function as a generalization of the linear response function. In this paper, we propose the fuzzy regression model with the monotonic response function and the algorithm to construct the proposed model by using ${\alpha}-level$ set of fuzzy number and the resolution identity theorem. To estimate parameters of the proposed model, the least squares (LS) method and the least absolute deviation (LAD) method have been used in this paper. In addition, to evaluate the performance of the proposed model, two performance measures of goodness of fit are introduced. The numerical examples indicate that the fuzzy regression model with the monotonic response function is preferable to the fuzzy linear regression model when the fuzzy data represent the non-linear pattern.

최우도 모형을 이용한 수위-유량곡선식 개발 (Development of Rating Curves Using a Maximum Likelihood Model)

  • 김경훈;박준일;신찬기
    • 환경위생공학
    • /
    • 제23권4호
    • /
    • pp.83-93
    • /
    • 2008
  • The non-linear least squares model(NLSM) has long been the standard technique used by hydrologists for constructing rating curves. The reasons for its adaptation are vague, and its appropriateness as a method of describing discharge measurement uncertainty has not been well investigated. It is shown in this paper that the classical method of NLSM can model only a very limited class of variance heterogeneity. Furthermore, this lack of flexibility often leads to unaccounted heteroscedasticity, resulting in dubious values for the rating curve parameters and estimated discharge. By introducing a heteroscedastic maximum likelihood model(HMLM), the variance heterogeneity is treated more generally. The maximum likelihood model stabilises the variance better than the NLSM approach, and thus is a more robust and appropriate way to fit a rating curve to a set of discharge measurements.

Geometric Geoid Determination in South Korea using GPS/Levelling Data

  • Yun, Hong-Sic
    • 한국측량학회지
    • /
    • 제13권2호
    • /
    • pp.285-289
    • /
    • 1995
  • 본 논문은 GPS로부터 측정된 타원체고와 Spirit Levelling으로부터 측정한 정표고를 이용하여 기하학적인 방법에 의하여 지오이드를 결정하였다. 기하학적인 지오이드고의 계산을 위하여 88점의 GPS측정데이타를 사용하였으며, 지오이드고는 간단하게 타원체고와 정표고의 차이를 구하고 보간법에 의하여 지오이드면을 결정하였다. 또한 보간법의 정확도를 평가하기 위하여 Minimum Curvature 방법과 Least Square Plane Fit방법을 사용하여 보간한 결과와 실측치를 비교하였으며, FFT를 적용하여 중력학적인 방법으로 결정한 지오이드고와 비교, 분석하였다.

  • PDF

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

  • Kim, Namhyun
    • Communications for Statistical Applications and Methods
    • /
    • 제24권5호
    • /
    • pp.519-531
    • /
    • 2017
  • We consider goodness-of-fit test statistics for Weibull distributions when data are randomly censored and the parameters are unknown. Koziol and Green (Biometrika, 63, 465-474, 1976) proposed the $Cram\acute{e}r$-von Mises statistic's randomly censored version for a simple hypothesis based on the Kaplan-Meier product limit of the distribution function. We apply their idea to the other statistics based on the empirical distribution function such as the Kolmogorov-Smirnov and Liao and Shimokawa (Journal of Statistical Computation and Simulation, 64, 23-48, 1999) statistics. The latter is a hybrid of the Kolmogorov-Smirnov, $Cram\acute{e}r$-von Mises, and Anderson-Darling statistics. These statistics as well as the Koziol-Green statistic are considered as test statistics for randomly censored Weibull distributions with estimated parameters. The null distributions depend on the estimation method since the test statistics are not distribution free when the parameters are estimated. Maximum likelihood estimation and the graphical plotting method with the least squares are considered for parameter estimation. A simulation study enables the Liao-Shimokawa statistic to show a relatively high power in many alternatives; however, the null distribution heavily depends on the parameter estimation. Meanwhile, the Koziol-Green statistic provides moderate power and the null distribution does not significantly change upon the parameter estimation.

Wage Determinants Analysis by Quantile Regression Tree

  • Chang, Young-Jae
    • Communications for Statistical Applications and Methods
    • /
    • 제19권2호
    • /
    • pp.293-301
    • /
    • 2012
  • Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.

선형회귀모형에서 변수 선택에 영향을 미치는 관측점에 관한 연구 (Influential observations on variable selection in linear regression model)

  • 최지훈;구자흥;이재준;전홍석
    • 응용통계연구
    • /
    • 제6권2호
    • /
    • pp.421-433
    • /
    • 1993
  • 회귀분석에서 몇 개의 관측치가 모형선택과정이나 최소제곱방법에의한 모형의 적합에서 지 대한 영향을 끼칠 수 있다. 그러나 그러한 관측치가 회귀분석의 모든 면에 같은 정도의 영 향을 끼치는 것은 아니다. 본 논문에서는 개개의 관측치가 적합성의 총체적 측도에 미치는 영향을 측정할 수 있는 영향측도를 제시하였다. 이 측도는 자료를 모형에 적합시켜 구해진 잔차와 적합치 사이에 만족되는 비상관성의 성질을 이용하여 구해진 것이다. 마지막으로, 본 논문에서 소개된 통계량들과 가장 많이 사용되고 있는 영향측도들을 실제자료의 분석을 통 하여 그 성질과 효율성을 비교하였다.

  • PDF

A New Parameter Estimation Method for a Zipf-like Distribution for Geospatial Data Access

  • Li, Rui;Feng, Wei;Wang, Hao;Wu, Huayi
    • ETRI Journal
    • /
    • 제36권1호
    • /
    • pp.134-140
    • /
    • 2014
  • Many reports have shown that the access pattern for geospatial tiles follows Zipf's law and that its parameter ${\alpha}$ represents the access characteristics. However, visits to geospatial tiles have temporal and spatial popularities, and the ${\alpha}$-value changes as they change. We construct a mathematical model to simulate the user's access behavior by studying the attributes of frequently visited tile objects to determine parameter estimation algorithms. Because the least squares (LS) method in common use cannot obtain an exact ${\alpha}$-value and does not provide a suitable fit to data for frequently visited tiles, we present a new approach, which uses a moment method of estimation to obtain the value of ${\alpha}$ when ${\alpha}$ is close to 1. When ${\alpha}$ is further away from 1, the method uses the associated cache hit ratio for tile access and uses an LS method based on a critical cache size to estimate the value of ${\alpha}$. The decrease in the estimation error is presented and discussed in the section on experiment results. This new method, which provides a more accurate estimate of ${\alpha}$ than earlier methods, promises more effective prediction of requests for frequently accessed tiles for better caching and load balancing.