• Title/Summary/Keyword: 분위수

Search Result 82, Processing Time 0.019 seconds

Generalized Support Vector Quantile Regression (일반화 서포트벡터 분위수회귀에 대한 연구)

  • Lee, Dongju;Choi, Sujin
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.43 no.4
    • /
    • pp.107-115
    • /
    • 2020
  • Support vector regression (SVR) is devised to solve the regression problem by utilizing the excellent predictive power of Support Vector Machine. In particular, the ⲉ-insensitive loss function, which is a loss function often used in SVR, is a function thatdoes not generate penalties if the difference between the actual value and the estimated regression curve is within ⲉ. In most studies, the ⲉ-insensitive loss function is used symmetrically, and it is of interest to determine the value of ⲉ. In SVQR (Support Vector Quantile Regression), the asymmetry of the width of ⲉ and the slope of the penalty was controlled using the parameter p. However, the slope of the penalty is fixed according to the p value that determines the asymmetry of ⲉ. In this study, a new ε-insensitive loss function with p1 and p2 parameters was proposed. A new asymmetric SVR called GSVQR (Generalized Support Vector Quantile Regression) based on the new ε-insensitive loss function can control the asymmetry of the width of ⲉ and the slope of the penalty using the parameters p1 and p2, respectively. Moreover, the figures show that the asymmetry of the width of ⲉ and the slope of the penalty is controlled. Finally, through an experiment on a function, the accuracy of the existing symmetric Soft Margin, asymmetric SVQR, and asymmetric GSVQR was examined, and the characteristics of each were shown through figures.

Properties of alternative VaR for multivariate normal distributions (다변량 정규분포에서 대안적인 VaR의 특성)

  • Hong, Chong Sun;Lee, Gi Pum
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.6
    • /
    • pp.1453-1463
    • /
    • 2016
  • The most useful financial risk measure may be VaR (Value at Risk) which estimates the maximum loss amount statistically. The VaR tends to be estimated in many industries by using transformed univariate risk including variance-covariance matrix and a specific portfolio. Hong et al. (2016) are defined the Vector at Risk based on the multivariate quantile vector. When a specific portfolio is given, one point among Vector at Risk is founded as the best VaR which is called as an alternative VaR (AVaR). In this work, AVaRs have been investigated for multivariate normal distributions with many kinds of variance-covariance matrix and various portfolio weight vectors, and compared with VaRs. It has been found that the AVaR has smaller values than VaR. Some properties of AVaR are derived and discussed with these characteristics.

A study on the determination of substrata using the information of exponential response rate by simulation studies (모의실험을 기반으로 지수형 응답률 보정을 위한 세부 층 결정에 관한 연구)

  • Min, Joo-Won;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.621-636
    • /
    • 2018
  • Research on the application of informative sampling technique has been conducted in order to reduce the influence of non-response. Chung and Shin (Korean Journal of Applied Statistics, 30, 993-1004, 2017) showed that the estimation accuracy improved when using exponential response rate information for the parameter estimation if the distribution of errors included in the super population model follows normal distribution. However this method divides the stratum into equally spaced substrata to obtain the sample weight of the informative sampling technique and shows that the accuracy of the estimation improves as the number of substrata increases. In this study, with the given number of total sample size, the optimal substratum boundary points are calculated using equal space, quantile, and LH algorithm; consequently, the results using those methods are compared through simulation. We also studied the criteria to determine the number of substrata and substratum boundaries that can be used in practice with various types of auxiliary variable distributions.

Splice Length of GFRP Rebars Based on Flexural Tests of Unconfined RC Members (RC 부재 휨 실험에 의한 GFRP 보강근의 이음길이 제안)

  • Choi, Dong-Uk;Chun, Sung-Chul;Ha, Sang-Su
    • Journal of the Korea Concrete Institute
    • /
    • v.21 no.1
    • /
    • pp.65-74
    • /
    • 2009
  • Glass fiber reinforced polymer (GFRP) bars are sometimes used when corrosion of conventional reinforcing steel bar is of concern. In this study, a total of 36 beams and one-way slabs reinforced using GFRP bars were tested in flexure. Four different GFRP bars of 13 mm diameter were used in the test program. In most test specimens, the GFRP bars were lap spliced at center. All beams and slabs were tested under 4-point loads so that the spliced region be subject to constant moment. Test variables were splice lengths, cover thicknesses, and bar spacings. No stirrups were used in the spliced region so that the tests result in conservative bond strengths. Average bond stresses that develop between GFRP bars and concrete were determined through nonlinear analysis of the cross-sections. An average bond stress prediction equation was derived utilizing two-variable linear regression. A splice length equation based on 5% fractile concept was then developed. As a result of this study, a rational equation with which design splice lengths of the GFRP bars can be determined, was proposed.

Intergenerational economic mobility in Korea using a quantile regression analysis (한국의 세대 간 경제적 이동성 - 분위수회귀분석을 중심으로 -)

  • Richey, Jeremiah;Jeong, Kiho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.4
    • /
    • pp.715-725
    • /
    • 2014
  • This study uses a quantile regression analysis to investigate intergenerational economic mobility in Korea. The analysis is based on data from the 1st through 11th waves of the Korean Labor and Income Panel Study (KLIPS) conducted from 1998-2008. The household nature of the data allows us to link parents' incomes to children's incomes at different points in time. Using a quantile regression analysis instead of mean one reveals that the effect of fathers' earnings are different across the conditional distribution of sons' earnings, particularly being larger on the upper quantile than on the lower quantile. After controlling effect of sons' college education by including a dummy variable for the degree, however, the pattern among quantile effects for fathers' earnings is no longer clear. Instead a new pattern emerges that education has a much larger effect on the upper quantiles than on the lower ones. Using nonparametric estimates of conditional density curves based on the quantile regression results, we derive some interesting features in graphical forms, which are not obvious in numerical analysis.

CTE with weighted portfolios (가중 포트폴리오에서의 CTE)

  • Hong, Chong Sun;Shin, Dong Sik;Kim, Jae Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.1
    • /
    • pp.119-130
    • /
    • 2017
  • In many literatures on VaR and CTE for multivariate distribution, these are estimated by using transformed univariate distribution with a specific ratio of many kinds of portfolios. Even though there are lots of works to define quantiles for multivariate distributions, there does not exist a quantile uniquely. Hence, it is not easy to define the VaR and CTE. In this paper, we propose the weighted CTE vectors corresponding to various ratio combinations of many kinds of portfolios by extending the researches on the alternative VaR and integrated multivariate CTE based on multivariate quantiles. We extend relation equations about univariate CTEs to multivariate CTE vectors and discuss their characteristics. The proposed weighted CTEs are explored with some data from multivariate normal distribution and illustrative examples.

Landslide Triggering Rainfall Threshold Based on Landslide Type (사면파괴 유형별 강우 한계선 설정)

  • Lee, Ji-Sung;Kim, Yun-Tae;Song, Young-Karb;Jang, Dae-Heung
    • Journal of the Korean Geotechnical Society
    • /
    • v.30 no.12
    • /
    • pp.5-14
    • /
    • 2014
  • Most of slope failures have taken place between June and September in Korea, which cause a considerable damage to society. Rainfall intensity and duration are very significant triggering factors for landslide. In this paper, landslide-triggering rainfall threshold consisting of rainfall intensity-duration (I-D) was proposed. For this study, total 255 landslides were collected in landslide inventory during 1999 to 2012 from NDMI (National Disaster Management Institute), various reports, newspapers and field survey. And most of the required rainfall data were collected from KMA (Korea Meteorological Administration). The collected landslides were classified into three categories: debris flow, shallow landslide and unconfirmed. A rainfall threshold was proposed based on landslide type using statistical method such as quantile-regression method. Its validation was carried out based on 2013 landslide database. The proposed rainfall threshold was also compared with previous rainfall thresholds. The proposed landslide-triggering rainfall thresholds could be used in landslide early warning system in Korea.

A Graphical Method to Assess Goodness-of-Fit for Inverse Gaussian Distribution (역가우스분포에 대한 적합도 평가를 위한 그래프 방법)

  • Choi, Byungjin
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.1
    • /
    • pp.37-47
    • /
    • 2013
  • A Q-Q plot is an effective and convenient graphical method to assess a distributional assumption of data. The primary step in the construction of a Q-Q plot is to obtain a closed-form expression to represent the relation between observed quantiles and theoretical quantiles to be plotted in order that the points fall near the line y = a + bx. In this paper, we introduce a Q-Q plot to assess goodness-of-fit for inverse Gaussian distribution. The procedure is based on the distributional result that a transformed random variable $Y={\mid}\sqrt{\lambda}(X-{\mu})/{\mu}\sqrt{X}{\mid}$ follows a half-normal distribution with mean 0 and variance 1 when a random variable X has an inverse Gaussian distribution with location parameter ${\mu}$ and scale parameter ${\lambda}$. Simulations are performed to provide a guideline to interpret the pattern of points on the proposed inverse Gaussian Q-Q plot. An illustrative example is provided to show the usefulness of the inverse Gaussian Q-Q plot.

Panel data analysis with regression trees (회귀나무 모형을 이용한 패널데이터 분석)

  • Chang, Youngjae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.6
    • /
    • pp.1253-1262
    • /
    • 2014
  • Regression tree is a tree-structured solution in which a simple regression model is fitted to the data in each node made by recursive partitioning of predictor space. There have been many efforts to apply tree algorithms to various regression problems like logistic regression and quantile regression. Recently, algorithms have been expanded to the panel data analysis such as RE-EM algorithm by Sela and Simonoff (2012), and extension of GUIDE by Loh and Zheng (2013). The algorithms are briefly introduced and prediction accuracy of three methods are compared in this paper. In general, RE-EM shows good prediction accuracy with least MSE's in the simulation study. A RE-EM tree fitted to business survey index (BSI) panel data shows that sales BSI is the main factor which affects business entrepreneurs' economic sentiment. The economic sentiment BSI of non-manufacturing industries is higher than that of manufacturing ones among the relatively high sales group.

Estimating Price Elasticity of Residential Water Demand in Korea Using Panel Quatile Model (패널 분위수회귀 모형을 사용한 우리나라 지방 상수도 생활용수 수요의 가격탄력성 추정)

  • Kim, Hyung-Gun
    • Environmental and Resource Economics Review
    • /
    • v.27 no.1
    • /
    • pp.195-214
    • /
    • 2018
  • This study estimates the price elasticity of residential water demand in Korea. For that, annual panel data from the year of 2010 to 2013 for 161 local water services is estimated by using panel quantile model. As a result, the price elasticities of residental water demand in Korea are estimated to be between -0.156 and -0.189 depending on its quantile. In addition, the study finds that the estimated elasticity of residential water demand by traditional conditional mean regression is relatively more influenced by high demand areas because the distribution of residental water demand in Korea is left-skewed.