• Title/Summary/Keyword: Linear and multiple regression

Search Result 1,747, Processing Time 0.025 seconds

MOISTURE CONTENT MEASUREMENT OF POWDERED FOOD USING RF IMPEDANCE SPECTROSCOPIC METHOD

  • Kim, K. B.;Lee, J. W.;S. H. Noh;Lee, S. S.
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2000.11b
    • /
    • pp.188-195
    • /
    • 2000
  • This study was conducted to measure the moisture content of powdered food using RF impedance spectroscopic method. In frequency range of 1.0 to 30㎒, the impedance such as reactance and resistance of parallel plate type sample holder filled with wheat flour and red-pepper powder of which moisture content range were 5.93∼-17.07%w.b. and 10.87 ∼ 27.36%w.b., respectively, was characterized using by Q-meter (HP4342). The reactance was a better parameter than the resistance in estimating the moisture density defined as product of moisture content and bulk density which was used to eliminate the effect of bulk density on RF spectral data in this study. Multivariate data analyses such as principal component regression, partial least square regression and multiple linear regression were performed to develop one calibration model having moisture density and reactance spectral data as parameters for determination of moisture content of both wheat flour and red-pepper powder. The best regression model was one by the multiple linear regression model. Its performance for unknown data of powdered food was showed that the bias, standard error of prediction and determination coefficient are 0.179% moisture content, 1.679% moisture content and 0.8849, respectively.

  • PDF

A Study on Defect Diagnostics for Health Monitoring of a Turbo-Shaft Engine for SUAV (스마트 무인기용 터보축 엔진의 성능진단을 위한 결함 예측에 관한 연구)

  • Park Juncheol;Roh Taeseong;Choi Dongwhan
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • v.y2005m4
    • /
    • pp.248-251
    • /
    • 2005
  • In this paper, health monitoring technique has been studied for performance deterioration caused by the defects of the gas turbine. The parameters for performance diagnostics have been extracted by using GSP program for modeling the target engine. The virtual sensor model for the health monitoring has been built of those data. The position and magnitude of the defects of the engine components have been determined by using Multiple Linear Regression technique and the method using the weight in order to diagnose the single and multiple defects.

  • PDF

Multiple linear regression model-based voltage imbalance estimation for high-power series battery pack (다중선형회귀모델 기반 고출력 직렬 배터리 팩의 전압 불균형 추정)

  • Kim, Seung-Woo;Lee, Pyeong-Yeon;Han, Dong-Ho;Kim, Jong-hoon
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.1-8
    • /
    • 2019
  • In this paper, the electrical characteristics with various C-rates are tested with a high power series battery pack comprised of 18650 cylindrical nickel cobalt aluminum(NCA) lithium-ion battery. The electrical characteristics of discharge capacity test with 14S1P battery pack and electric vehicle (EV) cycle test with 4S1P battery pack are compared and analyzed by the various of C-rates. Multiple linear regression is used to estimate voltage imbalance of 14S1P and 4S1P battery packs with various C-rates based on experimental data. The estimation accuracy is evaluated by root mean square error(RMSE) to validate multiple linear regression. The result of this paper is contributed that to use for estimating the voltage imbalance of discharge capacity test with 14S1P battery pack using multiple linear regression better than to use the voltage imbalance of EV cycle with 4S1P battery pack.

Machine learning-based regression analysis for estimating Cerchar abrasivity index

  • Kwak, No-Sang;Ko, Tae Young
    • Geomechanics and Engineering
    • /
    • v.29 no.3
    • /
    • pp.219-228
    • /
    • 2022
  • The most widely used parameter to represent rock abrasiveness is the Cerchar abrasivity index (CAI). The CAI value can be applied to predict wear in TBM cutters. It has been extensively demonstrated that the CAI is affected significantly by cementation degree, strength, and amount of abrasive minerals, i.e., the quartz content or equivalent quartz content in rocks. The relationship between the properties of rocks and the CAI is investigated in this study. A database comprising 223 observations that includes rock types, uniaxial compressive strengths, Brazilian tensile strengths, equivalent quartz contents, quartz contents, brittleness indices, and CAIs is constructed. A linear model is developed by selecting independent variables while considering multicollinearity after performing multiple regression analyses. Machine learning-based regression methods including support vector regression, regression tree regression, k-nearest neighbors regression, random forest regression, and artificial neural network regression are used in addition to multiple linear regression. The results of the random forest regression model show that it yields the best prediction performance.

MULTIPLE OUTLIER DETECTION IN LOGISTIC REGRESSION BY USING INFLUENCE MATRIX

  • Lee, Gwi-Hyun;Park, Sung-Hyun
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.4
    • /
    • pp.457-469
    • /
    • 2007
  • Many procedures are available to identify a single outlier or an isolated influential point in linear regression and logistic regression. But the detection of influential points or multiple outliers is more difficult, owing to masking and swamping problems. The multiple outlier detection methods for logistic regression have not been studied from the points of direct procedure yet. In this paper we consider the direct methods for logistic regression by extending the $Pe\tilde{n}a$ and Yohai (1995) influence matrix algorithm. We define the influence matrix in logistic regression by using Cook's distance in logistic regression, and test multiple outliers by using the mean shift model. To show accuracy of the proposed multiple outlier detection algorithm, we simulate artificial data including multiple outliers with masking and swamping.

Orographic Precipitation Analysis with Regional Frequency Analysis and Multiple Linear Regression (지역빈도해석 및 다중회귀분석을 이용한 산악형 강수해석)

  • Yun, Hye-Seon;Um, Myoung-Jin;Cho, Won-Cheol;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.6
    • /
    • pp.465-480
    • /
    • 2009
  • In this study, single and multiple linear regression model were used to derive the relationship between precipitation and altitude, latitude and longitude in Jejudo. The single linear regression analysis was focused on whether orographic effect was existed in Jejudo by annual average precipitation, and the multiple linear regression analysis on whether orographic effect was applied to each duration and return period of quantile from regional frequency analysis by index flood method. As results of the regression analysis, it shows the relationship between altitude and precipitation strongly form a linear relationship as the length of duration and return period increase. The multiple linear regression precipitation estimates(which used altitude, latitude, and longitude information) were found to be more reasonable than estimates obtained using altitude only or altitude-latitude and altitude-longitude. Especially, as results of spatial distribution analysis by kriging method using GIS, it also provides realistic estimates for precipitation that the precipitation was occurred the southeast region as real climate of Jejudo. However, the accuracy of regression model was decrease which derived a short duration of precipitation or estimated high region precipitation even had long duration. Consequently the other factor caused orographic effect would be needed to estimate precipitation to improve accuracy.

Particle size distributions and concentrations above radiators in indoor environments: Exploratory results from Xi'an, China

  • Chen, Xi;Li, Angui
    • Environmental Engineering Research
    • /
    • v.20 no.3
    • /
    • pp.237-245
    • /
    • 2015
  • Particulate matter in indoor environments has caused public concerns in recent years. The objective of this research is to explore the influence of radiators on particle size distributions and concentrations. The particle size distributions as well as concentrations above radiators and in the adjacent indoor air are monitored in forty-two indoor environments in Xi'an, China. The temperatures, relative humidity and air velocities are also measured. The particle size distributions above radiators at ten locations are analyzed. The results show that the functional difference of indoor environments has little impact on the particle size distributions above radiators. Then the effects of the environmental parameters (particle concentrations in the adjacent indoor air, temperatures, relative humidities and air velocities) on particle concentrations above radiators are assessed by applying multiple linear regression analysis. Three multiple linear regression models are established to predict the concentrations of $PM_{10}$, $PM_{2.5}$ and $PM_1$ above radiators.

The health effects of low blood lead level in oxidative stress as a marker, serum gamma-glutamyl transpeptidase level, in male steelworkers

  • Su-Yeon Lee;Yong-Jin Lee;Young-Sun Min;Eun-Chul Jang;Soon-Chan Kwon;Inho Lee
    • Annals of Occupational and Environmental Medicine
    • /
    • v.34
    • /
    • pp.34.1-34.13
    • /
    • 2022
  • Background: This study aimed to investigate the association between lead exposure and serum gamma-glutamyl transpeptidase (γGT) levels as an oxidative stress marker in male steelworkers. Methods: Data were collected during the annual health examination of workers in 2020. A total of 1,654 steelworkers were selected, and the variables for adjustment included the workers' general characteristics, lifestyle, and occupational characteristics. The association between the blood lead level (BLL) and serum γGT level was investigated by multiple linear and logistic regression analyses. The BLL and serum γGT values that were transformed into natural logarithms were used in multiple linear regression analysis, and the tertile of BLL was used in logistic regression analysis. Results: The geometric mean of the participants' BLLs and serum γGT level was 1.36 ㎍/dL and 27.72 IU/L, respectively. Their BLLs differed depending on age, body mass index (BMI), smoking status, drinking status, shift work, and working period, while their serum γGT levels differed depending on age, BMI, smoking status, drinking status, physical activity, and working period. In multiple linear regression analysis, the difference in models 1, 2, and 3 was significant, obtaining 0.326, 0.176, and 0.172 (all: p < 0.001), respectively. In the multiple linear regression analysis stratified according to drinking status, BMI, and age, BLLs were positively associated with serum γGT levels. Regarding the logistic regression analysis, the odds ratio of the third BLL tertile in models 1, 2, and 3 (for having an elevated serum γGT level within the first tertile reference) was 2.74, 1.83, and 1.81, respectively. Conclusions: BLL was positively associated with serum γGT levels in male steelworkers even at low lead concentrations (< 5 ㎍/dL).

Multivariate statistical analysis of the comparative antioxidant activity of the total phenolics and tannins in the water and ethanol extracts of dried goji berry (Lycium chinense) fruits

  • Kim, Joo-Shin;Kimm, Haklin Alex
    • Korean Journal of Food Science and Technology
    • /
    • v.51 no.3
    • /
    • pp.227-236
    • /
    • 2019
  • Antioxidant activity in water and ethanol extracts of dried Lycium chinense fruit, as a result of the total phenolic and tannin content, was measured using a number of chemical and biochemical assays for radical scavenging and inhibition of lipid peroxidation, with the analysis being extended by applying a bootstrapping statistical method. Previous statistical analyses mostly provided linear correlation and regression analyses between antioxidant activity and increasing concentrations of phenolics and tannins in a concentration-dependent mode. The present study showed that multiple component or multivariate analysis by applying multiple regression analysis or regression planes proved more informative than linear regression analysis of the relationship between the concentration of individual components and antioxidant activity. In this paper, we represented the multivariate analysis of antioxidant activities of both phenolic and tannin contents combined in the water and ethanol extracts, which revealed the hidden observations that were not evident from linear statistical analysis.