• Title/Summary/Keyword: Normal linear regression model

검색결과 86건 처리시간 0.024초

The skew-t censored regression model: parameter estimation via an EM-type algorithm

  • Lachos, Victor H.;Bazan, Jorge L.;Castro, Luis M.;Park, Jiwon
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.333-351
    • /
    • 2022
  • The skew-t distribution is an attractive family of asymmetrical heavy-tailed densities that includes the normal, skew-normal and Student's-t distributions as special cases. In this work, we propose an EM-type algorithm for computing the maximum likelihood estimates for skew-t linear regression models with censored response. In contrast with previous proposals, this algorithm uses analytical expressions at the E-step, as opposed to Monte Carlo simulations. These expressions rely on formulas for the mean and variance of a truncated skew-t distribution, and can be computed using the R library MomTrunc. The standard errors, the prediction of unobserved values of the response and the log-likelihood function are obtained as a by-product. The proposed methodology is illustrated through the analyses of simulated and a real data application on Letter-Name Fluency test in Peruvian students.

HOS 특징 벡터를 이용한 장애 음성 분류 성능의 향상 (Performance Improvement of Classification Between Pathological and Normal Voice Using HOS Parameter)

  • 이지연;정상배;최흥식;한민수
    • 대한음성학회지:말소리
    • /
    • 제66호
    • /
    • pp.61-72
    • /
    • 2008
  • This paper proposes a method to improve pathological and normal voice classification performance by combining multiple features such as auditory-based and higher-order features. Their performances are measured by Gaussian mixture models (GMMs) and linear discriminant analysis (LDA). The combination of multiple features proposed by the frame-based LDA method is shown to be an effective method for pathological and normal voice classification, with a 87.0% classification rate. This is a noticeable improvement of 17.72% compared to the MFCC-based GMM algorithm in terms of error reduction.

  • PDF

로지스틱 회귀모형에서 이변량 정규분포에 근거한 로그-밀도비 (Log-density Ratio with Two Predictors in a Logistic Regression Model)

  • 강명욱;윤재은
    • 응용통계연구
    • /
    • 제26권1호
    • /
    • pp.141-149
    • /
    • 2013
  • 로지스틱회귀모형에서 두 설명변수의 조건부 분포가 모두 이변량 정규분포라고 할 수 있다면 설명변수들의 함수로 표현되는 로그-밀도비를 통해 모형에 포함시켜야하는 항을 알 수 있다. 두개의 이변량 정규분포에서 분산-공분산행렬이 같은 경우에는 이차항과 교차항 없이 일차항만으로 충분하다. 상관계수가 모두 0이면 교차항은 설명변수의 분산과 관계없이 필요하지 않다. 또한 로지스틱회귀모형에서 로그-밀도비를 통해 이차항과 교차항이 필요하지 않게 되는 다른 조건들도 알아본다.

수학교과의 동형고사 문항에서 양호도 향상에 유효한 최적정답율 산정에 관한 연구 (Study on Estimating the Optimal Number-right Score in Two Equivalent Mathematics-test by Linear Score Equating)

  • 홍석강
    • 한국수학교육학회지시리즈A:수학교육
    • /
    • 제37권1호
    • /
    • pp.1-13
    • /
    • 1998
  • In this paper, we have represented the efficient way how to enumerate the optimal number-right scores to adjust the item difficulty and to improve item discrimination. To estimate the optimal number-right scores in two equivalent math-tests by linear score equating a measurement error model was applied to the true scores observed from a pair of equivalent math-tests assumed to measure same trait. The model specification for true scores which is represented by the bivariate model is a simple regression model to inference the optimal number-right scores and we assume again that the two simple regression lines of raw scores and true scores are independent each other in their error models. We enumerated the difference between mean value of $\chi$* and ${\mu}$$\_$$\chi$/ and the difference between the mean value of y*and a+b${\mu}$$\_$$\chi$/ by making an inference the estimates from 2 error variable regression model. Furthermore, so as to distinguish from the original score points, the estimated number-right scores y’$\^$*/ as the estimated regression values of true scores with the same coordinate were moved to center points that were composed of such difference values with result of such parallel score moving procedure as above mentioned. We got the asymptotically normal distribution in Figure 5 that was represented as the optimal distribution of the optimal number-right scores so that we could decide the optimal proportion of number-right score in each item. Also by assumption that equivalence of two tests is closely connected to unidimensionality of a student’s ability. we introduce new definition of trait score to evaluate such ability in each item. In this study there are much limitations in getting the real true scores and in analyzing data of the bivariate error model. However, even with these limitations we believe that this study indicates that the estimation of optimal number right scores by using this enumeration procedure could be easily achieved.

  • PDF

한국 청소년(만 17세) 체격의 시대적 변천에 대한 통계적 모형 추정 -1983년부터 1993년까지- (Statistical Estimated Model of Chronological Change in Physical Growth and Development in Korean Youth(17 Years Old) - From 1983 To 1993 -)

  • 성웅현;윤석옥;윤태영;최중명;박순영
    • 보건교육건강증진학회지
    • /
    • 제12권2호
    • /
    • pp.36-47
    • /
    • 1995
  • This research was obtained from analyzing how the physiques of the 3rd grade students of high school for males and females and developed for the last eleven years(from 1983 to 1993). By the physiques and nutritional index of physical growth and development, Relative Body Weight of 36.62 exceeded the standard, on the other hand females showed lower records than the standard. Relative Chest Girth Index belonged to the normal type of males and females in all, in the comparison of the records between 1983 and 1993, males increased in average 0.29 and females in average 0.55. Relative Chest Girth Index of females was greater than that of females. By the results of Relative Sitting Height Index, growth of the lower body for males and females was greater than that of males. In case of Vervaeck Index, males increased in average 2.04 but females increased in average 1, 20 relatively less than males. These phenomena provided for the evidence of the deficient nutrition in females. In the regression models of body height and body weight within a certain period, statistical regression model types which best indicated chronological average changes of body height and body weight, took 3rd Order Polynomial Regression Model rather than linear regression model. In females, statistical regression model types which best is suitable for chronological average change of body height and body weight, took 4th and 2nd Order Polynomial Regression Model respectively. The prediction value of 1995 by estimated polynomial regression model anticipated that body height of 3rd grade year students of high school of males in 1993 went on increasing from 170.87cm to 171.79cm in average 0.92cm growth and that of females from 158.99cm to 160.79cm in average 1.80cm growth. In addition, body weight of males seemed to increase from 62.58kg to 64.52kg in average 1.94kg growth and that of females seemed to increase from 54.05kg to 54.19kg in average 0.14kg growth. Linear Regression Model was suitable for the regression model of body weight for body height. Prediction on increase of an average body weight for body height was that, according to growth of body height 1cm in males, body weight increased 1.41kg averagely and that of females 0.86kg. For that reason, we came to conclusion that body weight increase for body height 1cm in males was greater than that in females on average.

  • PDF

마할라노비스-다구치 시스템과 로지스틱 회귀의 성능비교 : 사례연구 (Performance Comparison of Mahalanobis-Taguchi System and Logistic Regression : A Case Study)

  • 이승훈;임근
    • 대한산업공학회지
    • /
    • 제39권5호
    • /
    • pp.393-402
    • /
    • 2013
  • The Mahalanobis-Taguchi System (MTS) is a diagnostic and predictive method for multivariate data. In the MTS, the Mahalanobis space (MS) of reference group is obtained using the standardized variables of normal data. The Mahalanobis space can be used for multi-class classification. Once this MS is established, the useful set of variables is identified to assist in the model analysis or diagnosis using orthogonal arrays and signal-to-noise ratios. And other several techniques have already been used for classification, such as linear discriminant analysis and logistic regression, decision trees, neural networks, etc. The goal of this case study is to compare the ability of the Mahalanobis-Taguchi System and logistic regression using a data set.

Electricity Price Forecasting in Ontario Electricity Market Using Wavelet Transform in Artificial Neural Network Based Model

  • Aggarwal, Sanjeev Kumar;Saini, Lalit Mohan;Kumar, Ashwani
    • International Journal of Control, Automation, and Systems
    • /
    • 제6권5호
    • /
    • pp.639-650
    • /
    • 2008
  • Electricity price forecasting has become an integral part of power system operation and control. In this paper, a wavelet transform (WT) based neural network (NN) model to forecast price profile in a deregulated electricity market has been presented. The historical price data has been decomposed into wavelet domain constitutive sub series using WT and then combined with the other time domain variables to form the set of input variables for the proposed forecasting model. The behavior of the wavelet domain constitutive series has been studied based on statistical analysis. It has been observed that forecasting accuracy can be improved by the use of WT in a forecasting model. Multi-scale analysis from one to seven levels of decomposition has been performed and the empirical evidence suggests that accuracy improvement is highest at third level of decomposition. Forecasting performance of the proposed model has been compared with (i) a heuristic technique, (ii) a simulation model used by Ontario's Independent Electricity System Operator (IESO), (iii) a Multiple Linear Regression (MLR) model, (iv) NN model, (v) Auto Regressive Integrated Moving Average (ARIMA) model, (vi) Dynamic Regression (DR) model, and (vii) Transfer Function (TF) model. Forecasting results show that the performance of the proposed WT based NN model is satisfactory and it can be used by the participants to respond properly as it predicts price before closing of window for submission of initial bids.

낙동강 유역의 유역 유출량 산정에 따른 지역별 가뭄 빈도분석 (Regional Drought Frequency Analysis with Estimated Monthly Runoff Series in the Nakdong River Basin)

  • 김성원
    • 한국농공학회지
    • /
    • 제41권5호
    • /
    • pp.53-67
    • /
    • 1999
  • In this study, regional frequency analysis is used to determine each subbasin drought frequency with watershed runoff which is calculated with Tank Model in Nakdong river basin. L-Monments methd which is almost unbiased and nearly normal distribution is applied to estimate paramers of drought frequency analysis of monthly runoff time series. The duration of '76-77 was the most severe drought year than othe rwater years in this study. To decide drought frequency of each subbasin from the main basin, it is calculated by interpolaing runoff from the frequency-druoght runoff relationship. and the linear regression analysis is accomplished between drought frequency of main basin and that of each subbasin. With the results of linear regression analysis, the drought runoff of each subbasin is calculated corresponing to drought frequency 10,20 and 30 years of Nakdong river basin considering safety standards for the design of impounding facilities. As the results of this study, the proposed methodology and procedure of this study can be applied to water budget analysis considering safety standards for the design of impounding facilities in the large-scale river basin. For this purpose, above all, it is recommanded that expansion of reliable observed runoff data is necessary instead of calculated runoff by rainfall-runoff conceptual model.

  • PDF

실측 철도 진동 데이터베이스를 이용한 철도진동 평가 시스템 개발 (Development of Railway Vibration Evaluation System Using Actual Railway Vibration Database)

  • 이현준;서은성;황영섭
    • 정보처리학회논문지:소프트웨어 및 데이터공학
    • /
    • 제8권4호
    • /
    • pp.153-162
    • /
    • 2019
  • 최근 철도소음으로 인해 발생하는 궤도 주변 구조물의 민원 방지와 궤도 주변 산업단지의 초정밀 장비들의 정상적인 운영을 위해 철도 진동을 정량적으로 평가할 수 있는 기술개발이 필요하다. 기존의 해석적인 방법은 매우 복잡한 동적 응답 모델이 요구되며, 요구 모델의 부정확성으로 인한 결과의 신뢰성을 확보하기 어려운 문제가 있다. 따라서, 본 논문에서는 철도 진동에 영향을 주는 요소들을 분류한 국내 철도진동 실측 데이터베이스를 기반으로 Linear Regression, Gradient Descent 기법을 이용해 철도 운행으로부터 발생되는 진동값을 추론하는 철도진동 평가 알고리즘 및 시스템을 제안한다. 제안된 알고리즘으로 얻은 추론결과는 기존의 해석적 방법에 비해 높은 효율성과 정확성을 보인다.

문헌 단위 인용 네트워크 내 인용과 중심성 지수 간 관계 추정에 관한 연구 (Curve Estimation among Citation and Centrality Measures in Article-level Citation Networks)

  • 유소영
    • 정보관리학회지
    • /
    • 제29권2호
    • /
    • pp.193-204
    • /
    • 2012
  • 이 연구에서는 인용 및 동시인용 문헌 네트워크에서의 중심성 지수를 사용한 추론 통계 적용의 첫 번째 단계로써 이들 간 관계의 선형성을 살펴보고자 하였다. 703개의 문헌 동시인용 네트워크를 활용하여 인용 빈도, 연결정도 중심성, 인접 중심성, 매개 중심성 간의 4가지 주요 관계의 패턴을 살펴본 결과, 모든 인용 및 중심성 간 관계가 선형모델보다는 비선형적 모델로 더 잘 설명될 수 있음을 통계적으로 확인되었다. 따라서 이들 간의 인과관계에 대한 다중회귀분석과 같은 추론 통계 분석의 기반이 되는 선형성을 확보하기 위해서는 논리적인 기준에 근거한 데이터 변환이나 실제값을 구간값으로 변환하는 과정이 필요하다고 할 수 있다.