• 제목/요약/키워드: Multiple regression model

검색결과 2,523건 처리시간 0.03초

Pennsylvania주 옥수수 재배 토양의 질소공급능력 평가 (N-supplying Capability Evaluation of Corn Field Soils in Pennsylvania)

  • 홍순달
    • 한국토양비료학회지
    • /
    • 제31권4호
    • /
    • pp.359-367
    • /
    • 1998
  • 미국 Pennsylvania주 옥수수 재배 토양의 질소공급능력을 1986년부터 3년간 수행되었던 47개 토양의 화학성 및 정밀 토양도 속성들과의 회귀분석으로 평가 비교하였다. 질소공급능력과 가장 밀접한 상관을 보인 화학성은 $NO_3-N$ 함량($R^2=0.518$)이었으나 질소공급능력에 대한 표준화 편회귀계수는 년차간 변이를 보이며 0.578로 다른 성질들과 큰 차이를 보이지 않았다. 질소공급능력에 대한 다중선형 회귀분석은 단순 회귀분석에 비하여 양호한 평가를 보였으며 화학성들에 의한 결정계수는 $R^2=0.599$, 화학성과 Ap층 깊이의 정량적 지표들에 의한 계수는 $R^2=0.698$, 정량적 지표들과 정성적 지표들에 의한 계수는 $R^2=0.839$로 증가되었다. 이는 다중선형 회귀모델식이 단순 회귀모델식보다 토양의 질소공급능력을 보다 신뢰성 있게 평가할 수 있는 접근방법임을 보여주었다.

  • PDF

잔차 분산을 이용한 선형회귀모형의 다중전환점 검정 (Testing for a multiple change point residual variance in regression model)

  • 이인석;김종태;이금자
    • Journal of the Korean Data and Information Science Society
    • /
    • 제12권1호
    • /
    • pp.27-40
    • /
    • 2001
  • 본 연구는 시간의 변화에 따라 여러 개의 전환점이 발생하여 선형회귀모형들이 여러번 변화할 때의 변환시점을 Gasser, Stroke와 Jennen-Steinmez의 잔차분산 추정량을 이용하여 검정하고 실제의 몇 가지 모형을 제시하여 Graphic을 통하여 조사한 결과 여기서 제시한 방법이 더 효과적으로 자중전환점을 찾을 수 있었다.

  • PDF

Multiple linear regression and fuzzy linear regression based assessment of postseismic structural damage indices

  • Fani I. Gkountakou;Anaxagoras Elenas;Basil K. Papadopoulos
    • Earthquakes and Structures
    • /
    • 제24권6호
    • /
    • pp.429-437
    • /
    • 2023
  • This paper studied the prediction of structural damage indices to buildings after earthquake occurrence using Multiple Linear Regression (MLR) and Fuzzy Linear Regression (FLR) methods. Particularly, the structural damage degree, represented by the Maximum Inter Story Drift Ratio (MISDR), is an essential factor that ensures the safety of the building. Thus, the seismic response of a steel building was evaluated, utilizing 65 seismic accelerograms as input signals. Among the several response quantities, the focus is on the MISDR, which expresses the postseismic damage status. Using MLR and FLR methods and comparing the outputs with the corresponding evaluated by nonlinear dynamic analyses, it was concluded that the FLR method had the most accurate prediction results in contrast to the MLR method. A blind prediction applying a set of another 10 artificial accelerograms also examined the model's effectiveness. The results revealed that the use of the FLR method had the smallest average percentage error level for every set of applied accelerograms, and thus it is a suitable modeling tool in earthquake engineering.

다항식 회귀분석을 이용한 전자저울의 비선형 특성 개선 연구 (A Study of the Nonlinear Characteristics Improvement for a Electronic Scale using Multiple Regression Analysis)

  • 채규수
    • 융합정보논문지
    • /
    • 제9권6호
    • /
    • pp.1-6
    • /
    • 2019
  • 본 연구에서는 다항식 회귀분석(Polynomial regression analysis) 방법을 이용하여 비선형 특성을 갖는 전자저울의 질량 추정 모델 개발이 이루어 졌다. 전자저울에 사용되는 로드셀의 출력 단자 전압을 기준 질량 추를 사용하여 직접 측정하였고 이 데이터를 이용하여 MS Office 엑셀의 행렬식 계산과 데이터 추세선 분석 기능을 이용하여 다항식 회귀모델을 구하였다. 5kg까지 측정 가능한 로드셀 전자저울을 사용하여 100g단위로 질량을 측정하였고 다항식 회귀분석(Multiple regression analysis) 모델을 구하였으며, 단순(1차), 2차, 3차 다항식 회귀분석에 대한 오차를 구하였다. 각 모델에 대한 회귀 방정식의 적합도 분석을 위해 결정계수(Coefficient of determination)를 제시하여 추정 질량과 측정 데이터와의 상관관계를 나타내었다. 본 연구에서 제안하는 3차 다항식 모델을 이용하여 추정 값의 표준편차가 10g, 결정계수 1.0으로 상당히 정확한 모델을 얻었다. 본 연구에 사용된 선형 회귀 분석 이론을 바탕으로 최근 인공지능 분야에서 많이 사용되고 있는 로지스틱 회귀 분석(Logistic regression analysis)을 활용하여 기상예측, 신약개발, 경제지표 분석 등의 분야에 대한 다양한 연구를 수행할 수 있을 것으로 생각된다.

기계학습을 적용한 자기보고 증상 기반의 어혈 변증 모델 구축 (Machine Learning Approach to Blood Stasis Pattern Identification Based on Self-reported Symptoms)

  • 김현호;양승범;강연석;박영배;김재효
    • Korean Journal of Acupuncture
    • /
    • 제33권3호
    • /
    • pp.102-113
    • /
    • 2016
  • Objectives : This study is aimed at developing and discussing the prediction model of blood stasis pattern of traditional Korean medicine(TKM) using machine learning algorithms: multiple logistic regression and decision tree model. Methods : First, we reviewed the blood stasis(BS) questionnaires of Korean, Chinese, and Japanese version to make a integrated BS questionnaire of patient-reported outcomes. Through a human subject research, patients-reported BS symptoms data were acquired. Next, experts decisions of 5 Korean medicine doctor were also acquired, and supervised learning models were developed using multiple logistic regression and decision tree. Results : Integrated BS questionnaire with 24 items was developed. Multiple logistic regression models with accuracy of 0.92(male) and 0.95(female) validated by 10-folds cross-validation were constructed. By decision tree modeling methods, male model with 8 decision node and female model with 6 decision node were made. In the both models, symptoms of 'recent physical trauma', 'chest pain', 'numbness', and 'menstrual disorder(female only)' were considered as important factors. Conclusions : Because machine learning, especially supervised learning, can reveal and suggest important or essential factors among the very various symptoms making up a pattern identification, it can be a very useful tool in researching diagnostics of TKM. With a proper patient-reported outcomes or well-structured database, it can also be applied to a pre-screening solutions of healthcare system in Mibyoung stage.

Development of a soil total carbon prediction model using a multiple regression analysis method

  • Jun-Hyuk, Yoo;Jwa-Kyoung, Sung;Deogratius, Luyima;Taek-Keun, Oh;Jaesung, Cho
    • 농업과학연구
    • /
    • 제48권4호
    • /
    • pp.891-897
    • /
    • 2021
  • There is a need for a technology that can quickly and accurately analyze soil carbon contents. Existing soil carbon analysis methods are cumbersome in terms of professional manpower requirements, time, and cost. It is against this background that the present study leverages the soil physical properties of color and water content levels to develop a model capable of predicting the carbon content of soil sample. To predict the total carbon content of soil, the RGB values, water content of the soil, and lux levels were analyzed and used as statistical data. However, when R, G, and B with high correlations were all included in a multiple regression analysis as independent variables, a high level of multicollinearity was noted and G was thus excluded from the model. The estimates showed that the estimation coefficients for all independent variables were statistically significant at a significance level of 1%. The elastic values of R and B for the soil carbon content, which are of major interest in this study, were -2.90 and 1.47, respectively, showing that a 1% increase in the R value was correlated with a 2.90% decrease in the carbon content, whereas a 1% increase in the B value tallied with a 1.47% increase in the carbon content. Coefficient of determination (R2), root mean square error (RMSE), and mean absolute percentage error (MAPE) methods were used for regression verification, and calibration samples showed higher accuracy than the validation samples in terms of R2 and MAPE.

DETECTION OF OUTLIERS IN WEIGHTED LEAST SQUARES REGRESSION

  • Shon, Bang-Yong;Kim, Guk-Boh
    • Journal of applied mathematics & informatics
    • /
    • 제4권2호
    • /
    • pp.501-512
    • /
    • 1997
  • In multiple linear regression model we have presupposed assumptions (independence normality variance homogeneity and so on) on error term. When case weights are given because of variance heterogeneity we can estimate efficiently regression parameter using weighted least squares estimator. Unfortunately this estimator is sen-sitive to outliers like ordinary least squares estimator. Thus in this paper we proposed some statistics for detection of outliers in weighted least squares regression.

저류함수모형의 매개변수 보정과 홍수예측 (2) 홍수예측방법의 비교 연구 (Parameter Calibration of Storage Function Model and Flood Forecasting (2) Comparative Study on the Flood Forecasting Methods)

  • 김범준;송재현;김형수;홍일표
    • 대한토목학회논문집
    • /
    • 제26권1B호
    • /
    • pp.39-50
    • /
    • 2006
  • 홍수를 예측하기 위해서 국내 5대강 유역의 홍수통제소는 저류함수모형을 사용하고 있으며 현재까지 홍수예측에 대한 많은 연구가 이루어지고 있다. 이에 본 논문에서는 현재 홍수통제소에서 사용되고 있는 저류함수모형과 과거의 강우-수위 관계를 이용한 회귀분석(regression analysis), 그리고 인공신경망(artificial neural network)을 이용하여 홍수를 예측하고 이를 비교, 분석하고자 하였다. 저류함수모형의 경우는 홍수통제소의 대표매개변수와 보정된 최적(평균)매개변수를 적용하였다. 그리고 회귀분석과 인공신경망은 1995~2001년까지의 홍수사상 중 4개의 홍수사상을 선택하여 회귀계수를 구하고 역전파(backpropagation) 알고리즘을 사용하여 학습을 시켰다. 그 결과 저류함수모형의 경우 최적 매개변수를 이용하였을 때 기존의 홍수통제소에서 사용하고 있는 대표매개변수보다 예측이 개선되었으며, 회귀분석의 방법인 다중회귀분석, Robust 회귀분석, Stepwise 회귀분석을 이용한 홍수예측은 비교적 정확한 결과를 얻을 수 있었다. 역전파 알고리즘을 사용한 인공신경망의 경우도 회귀분석을 이용한 홍수예측보다는 다소 못하였지만 정확한 결과를 얻을 수 있었다.

Evaluation of Sigumjang Aroma by Stepwise Multiple Regression Analysis of Gas Chromatographic Profiles

  • Choi, Ung-Kyu;Kwon, O-Jun;Lee, Eun-Jeong;Son, Dong-Hwa;Cho, Young-Je;Im, Moo-Hyeog;Chung, Yung-Gun
    • Journal of Microbiology and Biotechnology
    • /
    • 제10권4호
    • /
    • pp.476-481
    • /
    • 2000
  • A linear correlation, by the stepwise multiple regression analysis, was found between the sensory test of Sigumjang aroma and the gas chromatographic data which were transformed with logarithm. GC data is the most objective method to evaluate Sigumjang aroma. A multiple correlation coefficient and a determination coefficient of more than 0.9 were obtained at the 9th and 13th steps, respectively. At step 31, the coefficient of determination level of 0.95 was attained. The accuracy of its estimation became higher as the number of the variables entered into the regression model increased. Over 90% of the Sigumjang aroma was explained by 13 compounds indentified on GC. The contributing proportion of the peak 26 was the highest followed by peaks 57 (9.27%), 29 (7.51%), 54 (6.01%), 8 (5.99%), 49 (4.97%), and 13 (4.11%).

  • PDF

주성분 분석과 다중회귀모형을 사용한 자동차 건조 공정의 히트펌프 건조기 소모 전력 분석 (Analyses of Power Consumption of the Heat Pump Dryer in the Automobile Drying Process by using the Principal Component Analysis and Multiple Regression)

  • 이창용;송근수;김진호
    • 산업경영시스템학회지
    • /
    • 제38권1호
    • /
    • pp.143-151
    • /
    • 2015
  • In this paper, we investigate how the power consumption of a heat pump dryer depends on various factors in the drying process by analyzing variables that affect the power consumption. Since there are in general many variables that affect the power consumption, for a feasible analysis, we utilize the principal component analysis to reduce the number of variables (or dimensionality) to two or three. We find that the first component is correlated positively to the entrance temperature of various devices such as compressor, expander, evaporator, and the second, negatively to condenser. We then model the power consumption as a multiple regression with two and/or three transformed variables of the selected principal components. We find that fitted value from the multiple regression explains 80~90% of the observed value of the power consumption. This results can be applied to a more elaborate control of the power consumption in the heat pump dryer.