• 제목/요약/키워드: REGRESSION ANALYSIS

Search Result 24,158, Processing Time 0.047 seconds

Cloud Removal Using Gaussian Process Regression for Optical Image Reconstruction

  • Park, Soyeon;Park, No-Wook
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.4
    • /
    • pp.327-341
    • /
    • 2022
  • Cloud removal is often required to construct time-series sets of optical images for environmental monitoring. In regression-based cloud removal, the selection of an appropriate regression model and the impact analysis of the input images significantly affect the prediction performance. This study evaluates the potential of Gaussian process (GP) regression for cloud removal and also analyzes the effects of cloud-free optical images and spectral bands on prediction performance. Unlike other machine learning-based regression models, GP regression provides uncertainty information and automatically optimizes hyperparameters. An experiment using Sentinel-2 multi-spectral images was conducted for cloud removal in the two agricultural regions. The prediction performance of GP regression was compared with that of random forest (RF) regression. Various combinations of input images and multi-spectral bands were considered for quantitative evaluations. The experimental results showed that using multi-temporal images with multi-spectral bands as inputs achieved the best prediction accuracy. Highly correlated adjacent multi-spectral bands and temporally correlated multi-temporal images resulted in an improved prediction accuracy. The prediction performance of GP regression was significantly improved in predicting the near-infrared band compared to that of RF regression. Estimating the distribution function of input data in GP regression could reflect the variations in the considered spectral band with a broader range. In particular, GP regression was superior to RF regression for reproducing structural patterns at both sites in terms of structural similarity. In addition, uncertainty information provided by GP regression showed a reasonable similarity to prediction errors for some sub-areas, indicating that uncertainty estimates may be used to measure the prediction result quality. These findings suggest that GP regression could be beneficial for cloud removal and optical image reconstruction. In addition, the impact analysis results of the input images provide guidelines for selecting optimal images for regression-based cloud removal.

Application of Crossover Analysis-logistic Regression in the Assessment of Gene- environmental Interactions for Colorectal Cancer

  • Wu, Ya-Zhou;Yang, Huan;Zhang, Ling;Zhang, Yan-Qi;Liu, Ling;Yi, Dong;Cao, Jia
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.2031-2037
    • /
    • 2012
  • Background: Analysis of gene-gene and gene-environment interactions for complex multifactorial human disease faces challenges regarding statistical methodology. One major difficulty is partly due to the limitations of parametric-statistical methods for detection of gene effects that are dependent solely or partially on interactions with other genes or environmental exposures. Based on our previous case-control study in Chongqing of China, we have found increased risk of colorectal cancer exists in individuals carrying a novel homozygous TT at locus rs1329149 and known homozygous AA at locus rs671. Methods: In this study, we proposed statistical method-crossover analysis in combination with logistic regression model, to further analyze our data and focus on assessing gene-environmental interactions for colorectal cancer. Results: The results of the crossover analysis showed that there are possible multiplicative interactions between loci rs671 and rs1329149 with alcohol consumption. Multifactorial logistic regression analysis also validated that loci rs671 and rs1329149 both exhibited a multiplicative interaction with alcohol consumption. Moreover, we also found additive interactions between any pair of two factors (among the four risk factors: gene loci rs671, rs1329149, age and alcohol consumption) through the crossover analysis, which was not evident on logistic regression. Conclusions: In conclusion, the method based on crossover analysis-logistic regression is successful in assessing additive and multiplicative gene-environment interactions, and in revealing synergistic effects of gene loci rs671 and rs1329149 with alcohol consumption in the pathogenesis and development of colorectal cancer.

The disparity profile of working conditions by the type of employment according to the economic sectors and occupations (임금근로자의 고용형태별 유해요인 노출 격차의 업종별 직종별 분포 특성)

  • Rhee, Kyung-Yong;Kim, Ki-Sik;Yoon, Young-Shik
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.4
    • /
    • pp.197-207
    • /
    • 2013
  • OSHA(Occupational Safety and Health Act) generally regulates employer's business principles in the workplace to maintain safety environment. This act has the fundamental purpose to protect employee's safety and health in the workplace by reducing industrial accidents. Authors tried to investigate the correlation between 'occupational injuries and illnesses' and level of regulation compliance using Survey on Current Status of Occupational Safety & Health data by the various statistical methods, such as generalized regression analysis, logistic regression analysis and poison regression analysis in order to compare the results of those methods. The results have shown that the significant affecting compliance factors were different among those statistical methods. This means that specific interpretation should be considered based on each statistical method. In the future, relevant statistical technique will be developed considering the distribution type of occupational injuries.

A Study on the Weight Estimation Model of Floating Offshore Structures using the Non-linear Regression Analysis (비선형 회귀 분석을 이용한 부유식 해양 구조물의 중량 추정 모델 연구)

  • Seo, Seong-Ho;Roh, Myung-Il;Shin, Hyunkyoung
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.51 no.6
    • /
    • pp.530-538
    • /
    • 2014
  • The weight estimation of floating offshore structures such as FPSO, TLP, semi-Submersibles, Floating Offshore Wind Turbines etc. in the preliminary design, is one of important measures of both construction cost and basic performance. Through both literature investigation and internet search, the weight data of floating offshore structures such as FPSO and TLP was collected. In this study, the weight estimation model was suggested for FPSO. The weight estimation model using non-linear regression analysis was established by fixing independent variables based on this data and the multiple regression analysis was introduced into the weight estimation model. Its reliability was within 4% of error rate.

Simplification of PMV through Multiple Regression Analysis (다중회귀분석을 통한 PMV 모델의 단순화)

  • Moon, Yong-Jun;Noh, Kwang-Chul;Oh, Myung-Do
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.19 no.11
    • /
    • pp.761-769
    • /
    • 2007
  • The purpose of this study is to present a simplified model of predicted mean vote (PMV) using multiple regression analysis. We performed the experiments and the numerical calculations in the lecture room during summer and winter to simplify PMV. And the multiple regression analysis on PMV was conducted to estimate the contribution of independent variables toward PMV. From the multiple regression analysis, we found that the effect of independent variables on PMV followed in order, clo value>air temperatur>air velocity>mean radiant temperature>relative humidity. And the simplified PMV was proposed through a few assumptions and then was compared with original PMV. They had a good agreement with each other. Additionally, we compared the simplified PMV with EDT. We expected that the simplified PMV can be more useful than EDT to evaluate the thermal comfort in the place, where radiation is dominant. But the comfort range of the simplified PMV should be adjusted to predict the exact thermal comfort in the future.

Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances (순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측)

  • Lee, Bom-Sock;Kim, Sung-Young
    • Journal of the Korean Institute of Gas
    • /
    • v.11 no.3
    • /
    • pp.13-18
    • /
    • 2007
  • The multivariate statistical analysis, using the multiple linear regression(MLR), have been applied to analyze and predict the flash points of binary systems. Prediction for the flash points of flammable substances is important for the examination of the fire and explosion hazards in the chemical process design. In this paper, the flash points are predicted by MLR based on the physical properties of pure substances and the experimental flash points data. The results of regression and prediction by MLR are compared with the values calculated by Raoult's law and Van Laar equation.

  • PDF

Implementation of a Regression Analysis System for the Control of Supplying Halibuts (넙치 공급량 조절을 위한 회귀분석 시스템 구현)

  • Ahn, Jinhyun;Kang, Jungwoon;Kim, Mincheol;Park, So-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.2
    • /
    • pp.321-324
    • /
    • 2022
  • The Korean halibut farming industry suffer from price instability and demand decrease due to various environmental and social issues. It is urgent to predict the appropriate amount of halibut production. However, it is not easy for employments working in the halibut farming industry to handle statistical tools in order to perform the prediction. In this paper, we implemented a Excel-based regression analysis tool that allows users to get a regression analysis result by just entering historical data in a sheet. Our tool will reduce workloads of employments working in the halibut farming industry by enabling them to perform a regression analysis with Excel on-the-fly. This study expect that by using the tool the halibut farming industry cope actively with the real-time change in the industry.

Hybrid Fuzzy Least Squares Support Vector Machine Regression for Crisp Input and Fuzzy Output

  • Shim, Joo-Yong;Seok, Kyung-Ha;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.2
    • /
    • pp.141-151
    • /
    • 2010
  • Hybrid fuzzy regression analysis is used for integrating randomness and fuzziness into a regression model. Least squares support vector machine(LS-SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate hybrid fuzzy linear and nonlinear regression models with crisp inputs and fuzzy output using weighted fuzzy arithmetic(WFA) and LS-SVM. LS-SVM allows us to perform fuzzy nonlinear regression analysis by constructing a fuzzy linear regression function in a high dimensional feature space. The proposed method is not computationally expensive since its solution is obtained from a simple linear equation system. In particular, this method is a very attractive approach to modeling nonlinear data, and is nonparametric method in the sense that we do not have to assume the underlying model function for fuzzy nonlinear regression model with crisp inputs and fuzzy output. Experimental results are then presented which indicate the performance of this method.

Development of Medical Cost Prediction Model Based on the Machine Learning Algorithm (머신러닝 알고리즘 기반의 의료비 예측 모델 개발)

  • Han Bi KIM;Dong Hoon HAN
    • Journal of Korea Artificial Intelligence Association
    • /
    • v.1 no.1
    • /
    • pp.11-16
    • /
    • 2023
  • Accurate hospital case modeling and prediction are crucial for efficient healthcare. In this study, we demonstrate the implementation of regression analysis methods in machine learning systems utilizing mathematical statics and machine learning techniques. The developed machine learning model includes Bayesian linear, artificial neural network, decision tree, decision forest, and linear regression analysis models. Through the application of these algorithms, corresponding regression models were constructed and analyzed. The results suggest the potential of leveraging machine learning systems for medical research. The experiment aimed to create an Azure Machine Learning Studio tool for the speedy evaluation of multiple regression models. The tool faciliates the comparision of 5 types of regression models in a unified experiment and presents assessment results with performance metrics. Evaluation of regression machine learning models highlighted the advantages of boosted decision tree regression, and decision forest regression in hospital case prediction. These findings could lay the groundwork for the deliberate development of new directions in medical data processing and decision making. Furthermore, potential avenues for future research may include exploring methods such as clustering, classification, and anomaly detection in healthcare systems.

Research on the thermal deformation model ins using by regression analysis (회귀분석을 이용한 열변형 오차 모델링에 관한 연구)

  • 김희술;고태조;김선호;김형식;정종운
    • Proceedings of the Korean Society of Machine Tool Engineers Conference
    • /
    • 2002.10a
    • /
    • pp.47-52
    • /
    • 2002
  • There are many factors in machine tool error. These are thermal deformation, geometric error, machine's part assembly error, error caused by tool bending. Among them thermal error is 70% of total error of machine tool . Prediction of thermal error is very difficult. because of nonlinear tendency of machine tool deformation. In this study, we tried thermal error prediction by using multi regression analysis.

  • PDF