• Title/Summary/Keyword: linear Bayes method

Search Result 32, Processing Time 0.021 seconds

Maritime radar display unit based on PC for safe ship navigation

  • Bae, Jin-Ho;Lee, Chong-Hyun;Hwang, Chang-Ku
    • International Journal of Ocean System Engineering
    • /
    • v.1 no.1
    • /
    • pp.52-59
    • /
    • 2011
  • A prototype radar display unit was implemented using inexpensive off-the-shelf components, including a nonlinear estimation algorithm for the target tracking in a clutter environment. Two custom designed boards; an analog signal processing board and a DSP board, can be plugged into an expansion slot of a personal computer (PC) to form a maritime radar display unit. Our system provided all the functionality specified in the International Maritime Organization (IMO) resolution A422(XI). The analog signal processing board was used for A/D conversion as well as rain and sea clutter suppression. The main functions of the DSP board were scan conversion and video overlay operations. A host PC was used to run the tracking algorithm of targets in clutter, using the discrete-time Bayes optimal (nonlinear, and non-Gaussian) estimation method, and the graphic user interface (GUI) software for Automatic Radar Plotting Aid (ARPA). The proposed tracking method recursively found the entire probability density function of the target position and velocity by converting into linear convolution operations.

Development of Evaluation Model for Black Spot Improvement Priorities by using Emperical Bayes Method (EB기법을 이용한 사고잦은 곳 개선사업 우선순위 판정기법 개발)

  • Jeong, Seong-Bong;Hwang, Bo-Hui;Seong, Nak-Mun;Lee, Seon-Ha
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.3
    • /
    • pp.81-90
    • /
    • 2009
  • The safety management of a road network comprises four basic inter-related components:identification of sites(black spot) requiring safety investigation, diagnosis of safety problems, selection of feasible treatments for potential treatment candidates, and prioritization of treatments given limited budgets(Persaud, 2001). Identification process of selecting black spot is very important for efficient investigation of sites. In this study, the accident prediction model for EB method was developed by using accident data and geometric conditions of black spots selected from four-leg signalized intersections in In-cheon City for three years (2004-2006). In addition, by comparing the rank nomination technique using EB method to that by using accident counts, we managed to show the problems which the existing method have and the necessity for developing rational prediction model. As a result, in terms of total number of accidents, both the counts predicted by existing non-linear regression model and that by EB method have high good of fitness, but EB method, considering both the accident counts by sites and total number of accident, has better good of fitness than non-linear poison model. According to the result of the comparison of ranks nominated for treatment between two methods, the rank for treatment of almost sites does not change but SeoHae intersection and a few other intersections have significant changes in their rank. This shows that, with the technique proposed in the study, the RTM problem caused by using real accident counts can be overcome.

On a Bayesian Estimation of Multivariate Regression Models with Constrained Coefficient Matrix

  • Kim, Hea-Jung
    • Journal of Korean Society for Quality Management
    • /
    • v.26 no.4
    • /
    • pp.151-165
    • /
    • 1998
  • Consider the linear multivariate regression model $Y=X_1B_1+X_2B_2+U$, where Vec(U)~N(0, $\sum \bigotimes I_N$). This paper is concerned with Bayes infreence of the model when it is suspected that the elements of $B_2$ are constrained in the form of intervals. The use of the Gibbs sampler as a method for calculating Bayesian marginal posterior desnities of the parameters under a generalized conjugate prior is developed. It is shown that the a, pp.oach is straightforward to specify distributionally and to implement computationally, with output readily adopted for required inference summaries. The method developed is a, pp.ied to a real problem.

  • PDF

Effective Computation for Odds Ratio Estimation in Nonparametric Logistic Regression

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.4
    • /
    • pp.713-722
    • /
    • 2009
  • The estimation of odds ratio and corresponding confidence intervals for case-control data have been done by traditional generalized linear models which assumed that the logarithm of odds ratio is linearly related to risk factors. We adapt a lower-dimensional approximation of Gu and Kim (2002) to provide a faster computation in nonparametric method for the estimation of odds ratio by allowing flexibility of the estimating function and its Bayesian confidence interval under the Bayes model for the lower-dimensional approximations. Simulation studies showed that taking larger samples with the lower-dimensional approximations help to improve the smoothing spline estimates of odds ratio in this settings. The proposed method can be used to analyze case-control data in medical studies.

Bayesian forecasting approach for structure response prediction and load effect separation of a revolving auditorium

  • Ma, Zhi;Yun, Chung-Bang;Shen, Yan-Bin;Yu, Feng;Wan, Hua-Ping;Luo, Yao-Zhi
    • Smart Structures and Systems
    • /
    • v.24 no.4
    • /
    • pp.507-524
    • /
    • 2019
  • A Bayesian dynamic linear model (BDLM) is presented for a data-driven analysis for response prediction and load effect separation of a revolving auditorium structure, where the main loads are self-weight and dead loads, temperature load, and audience load. Analyses are carried out based on the long-term monitoring data for static strains on several key members of the structure. Three improvements are introduced to the ordinary regression BDLM, which are a classificatory regression term to address the temporary audience load effect, improved inference for the variance of observation noise to be updated continuously, and component discount factors for effective load effect separation. The effects of those improvements are evaluated regarding the root mean square errors, standard deviations, and 95% confidence intervals of the predictions. Bayes factors are used for evaluating the probability distributions of the predictions, which are essential to structural condition assessments, such as outlier identification and reliability analysis. The performance of the present BDLM has been successfully verified based on the simulated data and the real data obtained from the structural health monitoring system installed on the revolving structure.

A Bayesian test for the first-order autocorrelations in regression analysis (회귀모형 오차항의 1차 자기상관에 대한 베이즈 검정법)

  • 김혜중;한성실
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.1
    • /
    • pp.97-111
    • /
    • 1998
  • This paper suggests a Bayesian method for testing first-order markov correlation among linear regression disturbances. As a Bayesian test criterion, Bayes factor is derived in the form of generalized Savage-Dickey density ratio that is easily estimated by means of posterior simulation via Gibbs sampling scheme. Performance of the Bayesian test is evaluated and examined based upon a Monte Carlo experiment and an empirical data analysis. Efficiency of the posterior simulation is also examined.

  • PDF

Bayesian curve-fitting with radial basis functions under functional measurement error model

  • Hwang, Jinseub;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.749-754
    • /
    • 2015
  • This article presents Bayesian approach to regression splines with knots on a grid of equally spaced sample quantiles of the independent variables under functional measurement error model.We consider small area model by using penalized splines of non-linear pattern. Specifically, in a basis functions of the regression spline, we use radial basis functions. To fit the model and estimate parameters we suggest a hierarchical Bayesian framework using Markov Chain Monte Carlo methodology. Furthermore, we illustrate the method in an application data. We check the convergence by a potential scale reduction factor and we use the posterior predictive p-value and the mean logarithmic conditional predictive ordinate to compar models.

A Halal Food Classification Framework Using Machine Learning Method for Enhancing Muslim Tourists (무슬림 관광객 증대를 위한 머신러닝 기반의 할랄푸드 분류 프레임워크)

  • Kim, Sun-A;Kim, Jeong-Won;Won, Dong-Yeon;Choi, Yerim
    • The Journal of Information Systems
    • /
    • v.26 no.3
    • /
    • pp.273-293
    • /
    • 2017
  • Purpose The purpose of this study is to introduce a framework that helps Muslims to determine whether a food can be consumed. It can complement existing Halal food classification services having a difficulty of constructing Halal food database. Design/methodology/approach The proposed framework includes two components. First, OCR(Optical Character Recognition) technique is utilized to read the food additive information. Second, machine learning methods were used to trained and predicted to determine whether a food can be consumed using the provided information. Findings Among the compared machine learning methods, SVM(Support Vector Machine), DT(Decision Tree), and NB(Naive Bayes), SVM with linear kernel and DT had excellent performance in the Halal food classification. The framework which adopting the proposed framework will enhance the tourism experiences of Muslim tourists who consider keeping the Islamic law most importantly. Furthermore, it can eventually contribute to the enhancement of smart tourism ecosystem.

Socioeconomic Predictors of Diabetes Mortality in Japan: An Ecological Study Using Municipality-specific Data

  • Okui, Tasuku
    • Journal of Preventive Medicine and Public Health
    • /
    • v.54 no.5
    • /
    • pp.352-359
    • /
    • 2021
  • Objectives: The aim of this study was to examine the geographic distribution of diabetes mortality in Japan and identify socioeconomic factors affecting differences in municipality-specific diabetes mortality. Methods: Diabetes mortality data by year and municipality from 2013 to 2017 were extracted from Japanese Vital Statistics, and the socioeconomic characteristics of municipalities were obtained from government statistics. We calculated the standardized mortality ratio (SMR) of diabetes for each municipality using the empirical Bayes method and represented geographic differences in SMRs in a map of Japan. Multiple linear regression was conducted to identify the socioeconomic factors affecting differences in SMR. Statistically significant socioeconomic factors were further assessed by calculating the relative risk of mortality of quintiles of municipalities classified according to the degree of each socioeconomic factor using Poisson regression analysis. Results: The geographic distribution of diabetes mortality differed by gender. Of the municipality-specific socioeconomic factors, high rates of single-person households and unemployment and a high number of hospital beds were associated with a high SMR for men. High rates of fatherless households and blue-collar workers were associated with a high SMR for women, while high taxable income per-capita income and total population were associated with low SMR for women. Quintile analysis revealed a complex relationship between taxable income and mortality for women. The mortality risk of quintiles with the highest and lowest taxable per-capita income was significantly lower than that of the middle-income quintile. Conclusions: Socioeconomic factors of municipalities in Japan were found to affect geographic differences in diabetes mortality.

The Effect of Meta-Features of Multiclass Datasets on the Performance of Classification Algorithms (다중 클래스 데이터셋의 메타특징이 판별 알고리즘의 성능에 미치는 영향 연구)

  • Kim, Jeonghun;Kim, Min Yong;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.1
    • /
    • pp.23-45
    • /
    • 2020
  • Big data is creating in a wide variety of fields such as medical care, manufacturing, logistics, sales site, SNS, and the dataset characteristics are also diverse. In order to secure the competitiveness of companies, it is necessary to improve decision-making capacity using a classification algorithm. However, most of them do not have sufficient knowledge on what kind of classification algorithm is appropriate for a specific problem area. In other words, determining which classification algorithm is appropriate depending on the characteristics of the dataset was has been a task that required expertise and effort. This is because the relationship between the characteristics of datasets (called meta-features) and the performance of classification algorithms has not been fully understood. Moreover, there has been little research on meta-features reflecting the characteristics of multi-class. Therefore, the purpose of this study is to empirically analyze whether meta-features of multi-class datasets have a significant effect on the performance of classification algorithms. In this study, meta-features of multi-class datasets were identified into two factors, (the data structure and the data complexity,) and seven representative meta-features were selected. Among those, we included the Herfindahl-Hirschman Index (HHI), originally a market concentration measurement index, in the meta-features to replace IR(Imbalanced Ratio). Also, we developed a new index called Reverse ReLU Silhouette Score into the meta-feature set. Among the UCI Machine Learning Repository data, six representative datasets (Balance Scale, PageBlocks, Car Evaluation, User Knowledge-Modeling, Wine Quality(red), Contraceptive Method Choice) were selected. The class of each dataset was classified by using the classification algorithms (KNN, Logistic Regression, Nave Bayes, Random Forest, and SVM) selected in the study. For each dataset, we applied 10-fold cross validation method. 10% to 100% oversampling method is applied for each fold and meta-features of the dataset is measured. The meta-features selected are HHI, Number of Classes, Number of Features, Entropy, Reverse ReLU Silhouette Score, Nonlinearity of Linear Classifier, Hub Score. F1-score was selected as the dependent variable. As a result, the results of this study showed that the six meta-features including Reverse ReLU Silhouette Score and HHI proposed in this study have a significant effect on the classification performance. (1) The meta-features HHI proposed in this study was significant in the classification performance. (2) The number of variables has a significant effect on the classification performance, unlike the number of classes, but it has a positive effect. (3) The number of classes has a negative effect on the performance of classification. (4) Entropy has a significant effect on the performance of classification. (5) The Reverse ReLU Silhouette Score also significantly affects the classification performance at a significant level of 0.01. (6) The nonlinearity of linear classifiers has a significant negative effect on classification performance. In addition, the results of the analysis by the classification algorithms were also consistent. In the regression analysis by classification algorithm, Naïve Bayes algorithm does not have a significant effect on the number of variables unlike other classification algorithms. This study has two theoretical contributions: (1) two new meta-features (HHI, Reverse ReLU Silhouette score) was proved to be significant. (2) The effects of data characteristics on the performance of classification were investigated using meta-features. The practical contribution points (1) can be utilized in the development of classification algorithm recommendation system according to the characteristics of datasets. (2) Many data scientists are often testing by adjusting the parameters of the algorithm to find the optimal algorithm for the situation because the characteristics of the data are different. In this process, excessive waste of resources occurs due to hardware, cost, time, and manpower. This study is expected to be useful for machine learning, data mining researchers, practitioners, and machine learning-based system developers. The composition of this study consists of introduction, related research, research model, experiment, conclusion and discussion.