• Title/Summary/Keyword: Gaussian process regression

Search Result 76, Processing Time 0.019 seconds

Cumulative Sums of Residuals in GLMM and Its Implementation

  • Choi, DoYeon;Jeong, KwangMo
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.5
    • /
    • pp.423-433
    • /
    • 2014
  • Test statistics using cumulative sums of residuals have been widely used in various regression models including generalized linear models(GLM). Recently, Pan and Lin (2005) extended this testing procedure to the generalized linear mixed models(GLMM) having random effects, in which we encounter difficulties in computing the marginal likelihood that is expressed as an integral of random effects distribution. The Gaussian quadrature algorithm is commonly used to approximate the marginal likelihood. Many commercial statistical packages provide an option to apply this type of goodness-of-fit test in GLMs but available programs are very rare for GLMMs. We suggest a computational algorithm to implement the testing procedure in GLMMs by a freely accessible R package, and also illustrate through practical examples.

Bayesian Optimization Analysis of Containment-Venting Operation in a Boiling Water Reactor Severe Accident

  • Zheng, Xiaoyu;Ishikawa, Jun;Sugiyama, Tomoyuki;Maruyama, Yu
    • Nuclear Engineering and Technology
    • /
    • v.49 no.2
    • /
    • pp.434-441
    • /
    • 2017
  • Containment venting is one of several essential measures to protect the integrity of the final barrier of a nuclear reactor during severe accidents, by which the uncontrollable release of fission products can be avoided. The authors seek to develop an optimization approach to venting operations, from a simulation-based perspective, using an integrated severe accident code, THALES2/KICHE. The effectiveness of the containment-venting strategies needs to be verified via numerical simulations based on various settings of the venting conditions. The number of iterations, however, needs to be controlled to avoid cumbersome computational burden of integrated codes. Bayesian optimization is an efficient global optimization approach. By using a Gaussian process regression, a surrogate model of the "black-box" code is constructed. It can be updated simultaneously whenever new simulation results are acquired. With predictions via the surrogate model, upcoming locations of the most probable optimum can be revealed. The sampling procedure is adaptive. Compared with the case of pure random searches, the number of code queries is largely reduced for the optimum finding. One typical severe accident scenario of a boiling water reactor is chosen as an example. The research demonstrates the applicability of the Bayesian optimization approach to the design and establishment of containment-venting strategies during severe accidents.

Autism Spectrum Disorder Detection in Children using the Efficacy of Machine Learning Approaches

  • Tariq Rafiq;Zafar Iqbal;Tahreem Saeed;Yawar Abbas Abid;Muneeb Tariq;Urooj Majeed;Akasha
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.4
    • /
    • pp.179-186
    • /
    • 2023
  • For the future prosperity of any society, the sound growth of children is essential. Autism Spectrum Disorder (ASD) is a neurobehavioral disorder which has an impact on social interaction of autistic child and has an undesirable effect on his learning, speaking, and responding skills. These children have over or under sensitivity issues of touching, smelling, and hearing. Its symptoms usually appear in the child of 4- to 11-year-old but parents did not pay attention to it and could not detect it at early stages. The process to diagnose in recent time is clinical sessions that are very time consuming and expensive. To complement the conventional method, machine learning techniques are being used. In this way, it improves the required time and precision for diagnosis. We have applied TFLite model on image based dataset to predict the autism based on facial features of child. Afterwards, various machine learning techniques were trained that includes Logistic Regression, KNN, Gaussian Naïve Bayes, Random Forest and Multi-Layer Perceptron using Autism Spectrum Quotient (AQ) dataset to improve the accuracy of the ASD detection. On image based dataset, TFLite model shows 80% accuracy and based on AQ dataset, we have achieved 100% accuracy from Logistic Regression and MLP models.

Advance Neuro-Fuzzy Modeling Using a New Clustering Algorithm (새로운 클러스터링 알고리듬을 적용한 향상된 뉴로-퍼지 모델링)

  • 김승석;김성수;유정웅
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.7
    • /
    • pp.536-543
    • /
    • 2004
  • In this paper, we proposed a new method of modeling a neuro-fuzzy system using a hybrid clustering algorithm. The initial parameters and the number of clusters of the proposed system are optimally chosen simultaneously with respect to the process of regression, which is a unique characteristics of the proposed system. The proposed algorithm presented in this work improves the overall performance of the proposed a neuro-fuzzy system by choosing a proper number of clusters adaptively according the characteristics of given data. The process of clustering is performed by deciding on the number of classes, which yields the property of convergence of the system. In experiments, the superiority of the proposed neuro-fuzzy system is demonstrated, especially the process of optimizing parameters and clustering of learning speed.

TGC-based Fish Growth Estimation Model using Gaussian Process Regression Approach (가우시안 프로세스 회귀를 통한 열 성장 계수 기반의 어류 성장 예측 모델)

  • Juhyoung Sung;Sungyoon Cho;Da-Eun Jung;Jongwon Kim;Jeonghwan Park;Kiwon Kwon;Young Myoung Ko
    • Journal of Internet Computing and Services
    • /
    • v.24 no.1
    • /
    • pp.61-69
    • /
    • 2023
  • Recently, as the fishery resources are depleted, expectations for productivity improvement by 'rearing fishery' in land farms are greatly rising. In the case of land farms, unlike ocean environments, it is easy to control and manage environmental and breeding factors, and has the advantage of being able to adjust production according to the production plan. On the other hand, unlike in the natural environment, there is a disadvantage in that operation costs may significantly increase due to the artificial management for fish growth. Therefore, profit maximization can be pursued by efficiently operating the farm in accordance with the planned target shipment. In order to operate such an efficient farm and nurture fish, an accurate growth prediction model according to the target fish species is absolutely required. Most of the growth prediction models are mainly numerical results based on statistical analysis using farm data. In this paper, we present a growth prediction model from a stochastic point of view to overcome the difficulties in securing data and the difficulty in providing quantitative expected values for inaccuracies that existing growth prediction models from a statistical point of view may have. For a stochastic approach, modeling is performed by introducing a Gaussian process regression method based on water temperature, which is the most important factor in positive growth. From the corresponding results, it is expected that it will be able to provide reference values for more efficient farm operation by simultaneously providing the average value of the predicted growth value at a specific point in time and the confidence interval for that value.

Design of Self-Organizing Networks with Competitive Fuzzy Polynomial Neuron (경쟁적 퍼지 다항식 뉴론을 가진 자기 구성 네트워크의 설계)

  • Park, Ho-Sung;Oh, Sung-Kwun;Kim, Hyun-Ki
    • Proceedings of the KIEE Conference
    • /
    • 2000.11d
    • /
    • pp.800-802
    • /
    • 2000
  • In this paper, we propose the Self-Organizing Networks(SON) based on competitive Fuzzy Polynomial Neuron(FPN) for the optimal design of nonlinear process system. The SON architectures consist of layers with activation nodes based on fuzzy inference rules. Here each activation node is presented as FPN which includes either the simplified or regression Polynomial fuzzy inference rules. The proposed SON is a network resulting from the fusion of the Polynomial Neural Networks(PNN) and a fuzzy inference system. The conclusion part of the rules, especially the regression polynomial uses several types of high-order polynomials such as liner, quadratic and modified quadratic. As the premise part of the rules, both triangular and Gaussian-like membership functions are studied. Chaotic time series data used to evaluate the performance of our proposed model.

  • PDF

Bayesian ordinal probit semiparametric regression models: KNHANES 2016 data analysis of the relationship between smoking behavior and coffee intake (베이지안 순서형 프로빗 준모수 회귀 모형 : 국민건강영양조사 2016 자료를 통한 흡연양태와 커피섭취 간의 관계 분석)

  • Lee, Dasom;Lee, Eunji;Jo, Seogil;Choi, Taeryeon
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.25-46
    • /
    • 2020
  • This paper presents ordinal probit semiparametric regression models using Bayesian Spectral Analysis Regression (BSAR) method. Ordinal probit regression is a way of modeling ordinal responses - usually more than two categories - by connecting the probability of falling into each category explained by a combination of available covariates using a probit (an inverse function of normal cumulative distribution function) link. The Bayesian probit model facilitates posterior sampling by bringing a latent variable following normal distribution, therefore, the responses are categorized by the cut-off points according to values of latent variables. In this paper, we extend the latent variable approach to a semiparametric model for the Bayesian ordinal probit regression with nonparametric functions using a spectral representation of Gaussian processes based BSAR method. The latent variable is decomposed into a parametric component and a nonparametric component with or without a shape constraint for modeling ordinal responses and predicting outcomes more flexibly. We illustrate the proposed methods with simulation studies in comparison with existing methods and real data analysis applied to a Korean National Health and Nutrition Examination Survey (KNHANES) 2016 for investigating nonparametric relationship between smoking behavior and coffee intake.

Evaluation of a Thermal Conductivity Prediction Model for Compacted Clay Based on a Machine Learning Method (기계학습법을 통한 압축 벤토나이트의 열전도도 추정 모델 평가)

  • Yoon, Seok;Bang, Hyun-Tae;Kim, Geon-Young;Jeon, Haemin
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.41 no.2
    • /
    • pp.123-131
    • /
    • 2021
  • The buffer is a key component of an engineered barrier system that safeguards the disposal of high-level radioactive waste. Buffers are located between disposal canisters and host rock, and they can restrain the release of radionuclides and protect canisters from the inflow of ground water. Since considerable heat is released from a disposal canister to the surrounding buffer, the thermal conductivity of the buffer is a very important parameter in the entire disposal safety. For this reason, a lot of research has been conducted on thermal conductivity prediction models that consider various factors. In this study, the thermal conductivity of a buffer is estimated using the machine learning methods of: linear regression, decision tree, support vector machine (SVM), ensemble, Gaussian process regression (GPR), neural network, deep belief network, and genetic programming. In the results, the machine learning methods such as ensemble, genetic programming, SVM with cubic parameter, and GPR showed better performance compared with the regression model, with the ensemble with XGBoost and Gaussian process regression models showing best performance.

Development of On-line Wrinkle Measurement System Using Machine Vision (머신 비젼을 이용한 실시간 링클 측정 시스템 개발)

  • Shin, Dong-Keun;To, Hoang-Minh;Ko, Sung-Lim
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.32 no.3
    • /
    • pp.274-279
    • /
    • 2008
  • Roll to roll (R2R) manufacturing process, also known as 'web processing', has been tried for producing electronic devices on a flexible plastic or metal foil. To increase the performance and productivity the R2R process, effective control and on-line supervision for web quality becomes very important. Wrinkle is one of the defects, which is incurred due to compressive stresses. A system for on-line measurement of wrinkle is developed using area scan camera and machine vision laser. The 2D image, obtained by area scan camera, is produced by Gaussian regression method to characterize the wrinkle on a transparent web. The experiment proves that 0.3mm wrinkle height can be measured successfully with 74fps.

Machine learning models for predicting the compressive strength of concrete containing nano silica

  • Garg, Aman;Aggarwal, Paratibha;Aggarwal, Yogesh;Belarbi, M.O.;Chalak, H.D.;Tounsi, Abdelouahed;Gulia, Reeta
    • Computers and Concrete
    • /
    • v.30 no.1
    • /
    • pp.33-42
    • /
    • 2022
  • Experimentally predicting the compressive strength (CS) of concrete (for a mix design) is a time-consuming and laborious process. The present study aims to propose surrogate models based on Support Vector Machine (SVM) and Gaussian Process Regression (GPR) machine learning techniques, which can predict the CS of concrete containing nano-silica. Content of cement, aggregates, nano-silica and its fineness, water-binder ratio, and the days at which strength has to be predicted are the input variables. The efficiency of the models is compared in terms of Correlation Coefficient (CC), Root Mean Square Error (RMSE), Variance Account For (VAF), Nash-Sutcliffe Efficiency (NSE), and RMSE to observation's standard deviation ratio (RSR). It has been observed that the SVM outperforms GPR in predicting the CS of the concrete containing nano-silica.