• Title/Summary/Keyword: Bayesian Method

Search Result 1,146, Processing Time 0.035 seconds

A Method for Spam Message Filtering Based on Lifelong Machine Learning (Lifelong Machine Learning 기반 스팸 메시지 필터링 방법)

  • Ahn, Yeon-Sun;Jeong, Ok-Ran
    • Journal of IKEEE
    • /
    • v.23 no.4
    • /
    • pp.1393-1399
    • /
    • 2019
  • With the rapid growth of the Internet, millions of indiscriminate advertising SMS are sent every day because of the convenience of sending and receiving data. Although we still use methods to block spam words manually, we have been actively researching how to filter spam in a various ways as machine learning emerged. However, spam words and patterns are constantly changing to avoid being filtered, so existing machine learning mechanisms cannot detect or adapt to new words and patterns. Recently, the concept of Lifelong Learning emerged to overcome these limitations, using existing knowledge to keep learning new knowledge continuously. In this paper, we propose a method of spam filtering system using ensemble techniques of naive bayesian which is most commonly used in document classification and LLML(Lifelong Machine Learning). We validate the performance of lifelong learning by applying the model ELLA and the Naive Bayes most commonly used in existing spam filters.

Complex Segregation Analysis of Total Milk Yield in Churra Dairy Ewes

  • Ilahi, Houcine;Othmane, M. Houcine
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.24 no.3
    • /
    • pp.330-335
    • /
    • 2011
  • The mode of inheritance of total milk yield and its genetic parameters were investigated in Churra dairy sheep through segregation analyses using a Monte Carlo Markov Chains (MCMC) method. Data which consisted of 7,126 lactations belonging to 5,154 ewes were collected between 1999 and 2002 from 15 Spanish Churra dairy flocks. A postulated major gene was assumed to be additive and priors used for variance components were uniform. Based on 50 000 Gibbs samples from ten replicates chains of 100,000 cycles, the estimated marginal posterior means${\pm}$posterior standard deviations of variance components of milk yield were $23.17{\pm}18.42$, $65.20{\pm}25.05$, $120.40{\pm}42.12$ and $420.83{\pm}40.26$ for major gene variance ($\sigma_G^2$), polygenic variance ($\sigma_u^2$), permanent environmental variance ($\sigma_{pe}^2$) and error variance ($\sigma_e^2$), respectively. The results of this study showed the postulated major locus was not significant, and the 95% highest posterior density regions ($HPDs_{95%}$) of most major gene parameters included 0, and particularly for the major gene variance. The estimated transmission probabilities for the 95% highest posterior density regions ($HPDs_{95%}$) were overlapped. These results indicated that segregation of a major gene was unlikely and that the mode of inheritance of total milk yield in Churra dairy sheep is purely polygenic. Based on 50,000 Gibbs samples from ten replicates chains of 100,000 cycles, the estimated polygenic heritability and repeatability were $h^2=0.20{\pm}0.05$ and r=$0.34{\pm}0.06$, respectively.

Optimal Facial Emotion Feature Analysis Method based on ASM-LK Optical Flow (ASM-LK Optical Flow 기반 최적 얼굴정서 특징분석 기법)

  • Ko, Kwang-Eun;Park, Seung-Min;Park, Jun-Heong;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.4
    • /
    • pp.512-517
    • /
    • 2011
  • In this paper, we propose an Active Shape Model (ASM) and Lucas-Kanade (LK) optical flow-based feature extraction and analysis method for analyzing the emotional features from facial images. Considering the facial emotion feature regions are described by Facial Action Coding System, we construct the feature-related shape models based on the combination of landmarks and extract the LK optical flow vectors at each landmarks based on the centre pixels of motion vector window. The facial emotion features are modelled by the combination of the optical flow vectors and the emotional states of facial image can be estimated by the probabilistic estimation technique, such as Bayesian classifier. Also, we extract the optimal emotional features that are considered the high correlation between feature points and emotional states by using common spatial pattern (CSP) analysis in order to improvise the operational efficiency and accuracy of emotional feature extraction process.

Robust multiple imputation method for missings with boundary and outliers (한계와 이상치가 있는 결측치의 로버스트 다중대체 방법)

  • Park, Yousung;Oh, Do Young;Kwon, Tae Yeon
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.6
    • /
    • pp.889-898
    • /
    • 2019
  • The problem of missing value imputation for variables in surveys that include item missing becomes complicated if outliers and logical boundary conditions between other survey items cannot be ignored. If there are outliers and boundaries in a variable including missing values, imputed values based on previous regression-based imputation methods are likely to be biased and not meet boundary conditions. In this paper, we approach these difficulties in imputation by combining various robust regression models and multiple imputation methods. Through a simulation study on various scenarios of outliers and boundaries, we find and discuss the optimal combination of robust regression and multiple imputation method.

Fast Depth Video Coding with Intra Prediction on VVC

  • Wei, Hongan;Zhou, Binqian;Fang, Ying;Xu, Yiwen;Zhao, Tiesong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.7
    • /
    • pp.3018-3038
    • /
    • 2020
  • In the stereoscopic or multiview display, the depth video illustrates visual distances between objects and camera. To promote the computational efficiency of depth video encoder, we exploit the intra prediction of depth videos under Versatile Video Coding (VVC) and observe a diverse distribution of intra prediction modes with different coding unit sizes. We propose a hybrid scheme to further boost fast depth video coding. In the first stage, we adaptively predict the HADamard (HAD) costs of intra prediction modes and initialize a candidate list according to the HAD costs. Then, the candidate list is further improved by considering the probability distribution of candidate modes with different CU sizes. Finally, early termination of CU splitting is performed at each CU depth level based on the Bayesian theorem. Our proposed method is incorporated into VVC intra prediction for fast coding of depth videos. Experiments with 7 standard sequences and 4 Quantization parameters (Qps) validate the efficiency of our method.

Ensemble Downscaling of Soil Moisture Data Using BMA and ATPRK

  • Youn, Youjeong;Kim, Kwangjin;Chung, Chu-Yong;Park, No-Wook;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.4
    • /
    • pp.587-607
    • /
    • 2020
  • Soil moisture is essential information for meteorological and hydrological analyses. To date, many efforts have been made to achieve the two goals for soil moisture data, i.e., the improvement of accuracy and resolution, which is very challenging. We presented an ensemble downscaling method for quality improvement of gridded soil moisture data in terms of the accuracy and the spatial resolution by the integration of BMA (Bayesian model averaging) and ATPRK (area-to-point regression kriging). In the experiments, the BMA ensemble showed a 22% better accuracy than the data sets from ESA CCI (European Space Agency-Climate Change Initiative), ERA5 (ECMWF Reanalysis 5), and GLDAS (Global Land Data Assimilation System) in terms of RMSE (root mean square error). Also, the ATPRK downscaling could enhance the spatial resolution from 0.25° to 0.05° while preserving the improved accuracy and the spatial pattern of the BMA ensemble, without under- or over-estimation. The quality-improved data sets can contribute to a variety of local and regional applications related to soil moisture, such as agriculture, forest, hydrology, and meteorology. Because the ensemble downscaling method can be applied to the other land surface variables such as temperature, humidity, precipitation, and evapotranspiration, it can be a viable option to complement the accuracy and the spatial resolution of satellite images and numerical models.

A spatial analysis of Neyman-Scott rectangular pulses model using an approximate likelihood function (근사적 우도함수를 이용한 Neyman-Scott 구형펄스모형의 공간구조 분석)

  • Lee, Jeongjin;Kim, Yongku
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.5
    • /
    • pp.1119-1131
    • /
    • 2016
  • The Neyman-Scott Rectangular Pulses Model (NSRPM) is mainly used to construct hourly rainfall series. This model uses a modest number of parameters to represent the rainfall processes and underlying physical phenomena, such as the arrival of storms or rain cells. In NSRPM, the method of moments has often been used because it is difficult to know the distribution of rainfall intensity. Recently, approximated likelihood function for NSRPM has been introduced. In this paper, we propose a hierarchical model for applying a spatial structure to the NSRPM parameters using the approximated likelihood function. The proposed method is applied to summer hourly precipitation data observed at 59 weather stations (Korea Meteorological Administration) from 1973 to 2011.

Malicious Traffic Detection Using K-means (K-평균 클러스터링을 이용한 네트워크 유해트래픽 탐지)

  • Shin, Dong Hyuk;An, Kwang Kue;Choi, Sung Chune;Choi, Hyoung-Kee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.2
    • /
    • pp.277-284
    • /
    • 2016
  • Various network attacks such as DDoS(Distributed Denial of service) and orm are one of the biggest problems in the modern society. These attacks reduce the quality of internet service and caused the cyber crime. To solve the above problem, signature based IDS(Intrusion Detection System) has been developed by network vendors. It has a high detection rate by using database of previous attack signatures or known malicious traffic pattern. However, signature based IDS have the fatal weakness that the new types of attacks can not be detected. The reason is signature depend on previous attack signatures. In this paper, we propose a k-means clustering based malicious traffic detection method to complement the problem of signature IDS. In order to demonstrate efficiency of the proposed method, we apply the bayesian theorem.

A Study on Combined DoA Estimation Algorithm using LCMV and Maximum Posterior on Uniform Linear Array Antenna (균일 선형 배열 안테나에서 선형구속최소분산 방법과 사후 추정 확률을 결합한 도래 방향 추정 알고리즘 연구)

  • Lee, Kwan-Hyeong;Park, Sung-Kon;Jeong, Youn-Seo
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.9 no.3
    • /
    • pp.291-297
    • /
    • 2016
  • In this paper, we are comparative analysis of exit algorithm and proposal algorithm for desired target direction of arrival estimation in correlation signal system. Proposed algorithm in this paper is to decrease target direction of arrival an estimation error probability using bayesian, maximum posterior, and MUSIC algorithm in order to decrease direction of arrival error probability as optimize and use linear constrained minimum variance to update weight value. Through simulation, we were comparative analysis proposed algorithm and exit MUSIC algorithm. In case SNR is 10dB and antenna element arrays are 9 and 12, We show the superior performance of the proposed method relative to the class method to decrease of signal estimation error probability about 11% and 13%, respectively.

Development and Evaluation of Drought Outlook method Using Climate Prediction with Bayesian method (기후예측정보와 베이지안 기법을 활용한 가뭄전망기술 개발 및 평가)

  • Son, Kyung-Hwan;Bae, Deg-Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2015.05a
    • /
    • pp.22-22
    • /
    • 2015
  • 가뭄은 적시에 경보해야 하는 홍수와 달리 진행속도가 느리고 시간적으로 대처할 여유가 있어 진행중일지라도 미리 감지만 한다면 그 피해를 최소화할 수 있다. 이로 인해 미국 등 수문기상 선진국에서는 수문기상 장기예보자료로부터 가뭄전망정보 생산기술을 개발하였으며, 특히 가뭄전망의 정확도 향상을 위해 여러 통계적 보정기법을 적용하고 있다. 국내의 경우 기상청에서 가뭄전망을 목적으로 2011년에 수치예보모델을 이용하여 가뭄전망정보를 생산한바 있으나, 전망정보의 불확실성 문제로 가뭄예보에 활용하는데 한계가 있어 이를 개선할 수 있는 기술개발이 요구되는 실정이다. 본 연구에서는 기후예측자료를 이용하여 가뭄전망정보 생산기술을 개발하고 정확도 개선을 위해 베이지안 기법을 연계하였다. GloSea5 (Global Seasonal forecast model 5) 장기예보자료를 이용하였으며, 베이지안 기법을 통해 과거 관측자료에 대한 사전분포, 모델의 전망정보로부터 우도함수를 유도하여 최종 사후분포를 추정하였다. 베이지안 기법 적용 전 후에 따른 가뭄지수를 산정하였다. 관측자료 기반의 가뭄지수와의 비교분석을 통해 선행기간 및 계절별 가뭄예측 성능을 평가하였으며, 실제 가뭄기간 동안에 가뭄의 재현성을 지역별로 분석하였다. 장기예보자료만을 활용한 기존 가뭄전망에서는 관측 자료가 포함된 1개월 전망에서도 불확실성이 매우 높았지만 베이지안 기법 적용으로 가뭄전망의 정확도가 크게 개선되었다. 특히, 1, 2개월 전망의 시계열 가뭄지수가 관측기반의 가뭄지수의 거동과 매우 유사하게 나타났으며, 지역별로도 베이지안 기법 적용시 실제 가뭄피해 상황을 적절히 재현하는 것으로 나타났다. 국내 가뭄예보에 있어 기후예측정보를 단순활용하기 보다는 베이지안과 같은 통계적 보정기법을 이용하여 가뭄전망정보를 생산하는 것이 바람직하며, 본 연구에서는 가뭄예보업무에 활용될 수 있도록 베이지안 기법에 대한 검증 및 평가를 지속적으로 수행할 계획이다.

  • PDF