• Title/Summary/Keyword: outlier detection method

Search Result 128, Processing Time 0.021 seconds

Outlier Detection Based on Discrete Wavelet Transform with Application to Saudi Stock Market Closed Price Series

  • RASHEDI, Khudhayr A.;ISMAIL, Mohd T.;WADI, S. Al;SERROUKH, Abdeslam
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.12
    • /
    • pp.1-10
    • /
    • 2020
  • This study investigates the problem of outlier detection based on discrete wavelet transform in the context of time series data where the identification and treatment of outliers constitute an important component. An outlier is defined as a data point that deviates so much from the rest of observations within a data sample. In this work we focus on the application of the traditional method suggested by Tukey (1977) for detecting outliers in the closed price series of the Saudi Arabia stock market (Tadawul) between Oct. 2011 and Dec. 2019. The method is applied to the details obtained from the MODWT (Maximal-Overlap Discrete Wavelet Transform) of the original series. The result show that the suggested methodology was successful in detecting all of the outliers in the series. The findings of this study suggest that we can model and forecast the volatility of returns from the reconstructed series without outliers using GARCH models. The estimated GARCH volatility model was compared to other asymmetric GARCH models using standard forecast error metrics. It is found that the performance of the standard GARCH model were as good as that of the gjrGARCH model over the out-of-sample forecasts for returns among other GARCH specifications.

Development of a WPAN-based Self-positioning System for Indoor Flying Robots (실내 비행 로봇을 위한 WPAN 기반 자가 측위 시스템 개발)

  • Lim, Jeong-Min;Jeong, Won-Min;Sung, Tae-Kyung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.21 no.5
    • /
    • pp.490-495
    • /
    • 2015
  • As flying robots are becoming popular, there are increased needs to use themforsuch purposes as parcel delivery, serving in restaurants, and stage performances. To control flying robots such as quad copters, localization is essential. In order to properly position flying robots, many techniques are in development, including IR (infra-red)-based systemswhich catch markers on a flying robot in order that it can position itself. However, this technique demonstrates only short coverage. Furthermore, localization from inertial sensors diverges as time passes. For this reason, this paper suggests a TWR (two-way ranging) based positioning technique. Despite the weaknesses in currently available TWR system, this paper suggests a self-positioning and outlier detection technique in order to provide reliable position information with a faster update rate. The self-positioning system sends a shorter message which reduces wireless traffic. By detecting and removing outlier measurements, a positioning result with better accuracy is acquired. Finally, this paper shows that the suggesting system detects outlierssequentially from less than half the number of anchors in localization system according to the degree of outlier in measurement and the noise level. By performing an outlier algorithm, better positioning accuracy is acquired as shown in the experimental result.

Voronoi Diagram-based USBL Outlier Rejection for AUV Localization

  • Hyeonmin Sim;Hangil Joe
    • Journal of Ocean Engineering and Technology
    • /
    • v.38 no.3
    • /
    • pp.115-123
    • /
    • 2024
  • USBL systems are essential for providing accurate positions of autonomous underwater vehicles (AUVs). On the other hand, the accuracy can be degraded by outliers because of the environmental conditions. A failure to address these outliers can significantly impact the reliability of underwater localization and navigation systems. This paper proposes a novel outlier rejection algorithm for AUV localization using Voronoi diagrams and query point calculation. The Voronoi diagram divides data space into Voronoi cells that center on ultra-short baseline (USBL) data, and the calculated query point determines if the corresponding USBL data is an inlier. This study conducted experiments acquiring GPS and USBL data simultaneously and optimized the algorithm empirically based on the acquired data. In addition, the proposed method was applied to a sensor fusion algorithm to verify its effectiveness, resulting in improved pose estimations. The proposed method can be applied to various sensor fusion algorithms as a preprocess and could be used for outlier rejection for other 2D-based location sensors.

Outlier Detection and Treatment for the Conversion of Chemical Oxygen Demand to Total Organic Carbon (화학적산소요구량의 총유기탄소 변환을 위한 이상자료의 탐지와 처리)

  • Cho, Beom Jun;Cho, Hong Yeon;Kim, Sung
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.26 no.4
    • /
    • pp.207-216
    • /
    • 2014
  • Total organic carbon (TOC) is an important indicator used as an direct biological index in the research field of the marine carbon cycle. It is possible to produce the sufficient TOC estimation data by using the Chemical Oxygen Demand(COD) data because the available TOC data is relatively poor than the COD data. The outlier detection and treatment (removal) should be carried out reasonably and objectively because the equation for a COD-TOC conversion is directly affected the TOC estimation. In this study, it aims to suggest the optimal regression model using the available salinity, COD, and TOC data observed in the Korean coastal zone. The optimal regression model is selected by the comparison and analysis on the changes of data numbers before and after removal, variation coefficients and root mean square (RMS) error of the diverse detection methods of the outlier and influential observations. According to research result, it is shown that a diagnostic case combining SIQR (Semi - Inter-Quartile Range) boxplot and Cook's distance method is most suitable for the outlier detection. The optimal regression function is estimated as the TOC(mg/L) = $0.44{\cdot}COD(mg/L)+1.53$, then determination coefficient is showed a value of 0.47 and RMS error is 0.85 mg/L. The RMS error and the variation coefficients of the leverage values are greatly reduced to the 31% and 80% of the value before the outlier removal condition. The method suggested in this study can provide more appropriate regression curve because the excessive impacts of the outlier frequently included in the COD and TOC monitoring data is removed.

A sequential outlier detecting method using a clustering algorithm (군집 알고리즘을 이용한 순차적 이상치 탐지법)

  • Seo, Han Son;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.4
    • /
    • pp.699-706
    • /
    • 2016
  • Outlier detection methods without performing a test often do not succeed in detecting multiple outliers because they are structurally vulnerable to a masking effect or a swamping effect. This paper considers testing procedures supplemented to a clustering-based method of identifying the group with a minority of the observations as outliers. One of general steps is performing a variety of t-test on individual outlier-candidates. This paper proposes a sequential procedure for searching for outliers by changing cutoff values on a cluster tree and performing a test on a set of outlier-candidates. The proposed method is illustrated and compared to existing methods by an example and Monte Carlo studies.

A Generalized Likelihood Ratio Test in Outlier Detection (이상점 탐지를 위한 일반화 우도비 검정)

  • Jang Sun Baek
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.2
    • /
    • pp.225-237
    • /
    • 1994
  • A generalized likelihood ratio test is developed to detect an outlier associated with monitoring nuclear proliferation. While the classical outlier detection methods consider continuous variables only, our approach allows both continuous and discrete variables or a mixture of continuous and discrete variables to be used. In addition, our method is free of the normality assumption, which is the key assumption in most of the classical methods. The proposed test is constructed by applying the bootstrap to a generalized likelihood ratio. We investigate the performance of the test by studying the power with simulations.

  • PDF

A study on the outlier data estimation method for anomaly detection of photovoltaic system (태양광 발전 이상감지를 위한 아웃라이어 추정 방법에 대한 연구)

  • Seo, Jong Kwan;Lee, Tae Il;Lee, Whee Sung;Park, Jeom Bae
    • Journal of IKEEE
    • /
    • v.24 no.2
    • /
    • pp.403-408
    • /
    • 2020
  • Photovoltaic (PV) has both intermittent and uncertainty in nature, so it is difficult to accurately predict. Thus anomaly detection technology is important to diagnose real time PV generation. This paper identifies a correlation between various parameters and classifies the PV data applying k-nearest neighbor and dynamic time warpping. Results for the two classifications showed that an outlier detection by a fault of some facilities, and a temporary power loss by partial shading and overall shading occurring during the short period. Based on 100kW plant data, machine learning analysis and test results verified actual outliers and candidates of outlier.

Outlier detection of main engine data of a ship using ensemble method (앙상블 기법을 이용한 선박 메인엔진 빅데이터의 이상치 탐지)

  • KIM, Dong-Hyun;LEE, Ji-Hwan;LEE, Sang-Bong;JUNG, Bong-Kyu
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.56 no.4
    • /
    • pp.384-394
    • /
    • 2020
  • This paper proposes an outlier detection model based on machine learning that can diagnose the presence or absence of major engine parts through unsupervised learning analysis of main engine big data of a ship. Engine big data of the ship was collected for more than seven months, and expert knowledge and correlation analysis were performed to select features that are closely related to the operation of the main engine. For unsupervised learning analysis, ensemble model wherein many predictive models are strategically combined to increase the model performance, is used for anomaly detection. As a result, the proposed model successfully detected the anomalous engine status from the normal status. To validate our approach, clustering analysis was conducted to find out the different patterns of anomalies the anomalous point. By examining distribution of each cluster, we could successfully find the patterns of anomalies.

Bayesian Outlier Detection in Regression Model

  • Younshik Chung;Kim, Hyungsoon
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.3
    • /
    • pp.311-324
    • /
    • 1999
  • The problem of 'outliers', observations which look suspicious in some way, has long been one of the most concern in the statistical structure to experimenters and data analysts. We propose a model for an outlier problem and also analyze it in linear regression model using a Bayesian approach. Then we use the mean-shift model and SSVS(George and McCulloch, 1993)'s idea which is based on the data augmentation method. The advantage of proposed method is to find a subset of data which is most suspicious in the given model by the posterior probability. The MCMC method(Gibbs sampler) can be used to overcome the complicated Bayesian computation. Finally, a proposed method is applied to a simulated data and a real data.

  • PDF

A novel transmissibility concept based on wavelet transform for structural damage detection

  • Fan, Zhe;Feng, Xin;Zhou, Jing
    • Smart Structures and Systems
    • /
    • v.12 no.3_4
    • /
    • pp.291-308
    • /
    • 2013
  • A novel concept of transmissibility based on a wavelet transform for structural damage detection is presented in this paper. The main objective of the research was the development of a method for detecting slight damage at the incipient stage. As a vibration-based approach, the concept of transmissibility has attracted considerable interest because of its advantages and effectiveness in damage detection. However, like other vibration-based methods, transmissibility-based approaches suffer from insensitivity to slight local damage because of the regularity of the traditional Fourier transform. Therefore, the powerful signal processing techniques must be found to solve this problem. Wavelet transform that is able to capture subtle information in measured signals has received extensive attention in the field of damage detection in recent decades. In this paper, we first propose a novel transmissibility concept based on the wavelet transform. Outlier analysis was adopted to construct a damage detection algorithm with wavelet-based transmissibility. The feasibility of the proposed method was numerically investigated with a typical six-degrees-of-freedom spring-mass system, and comparative investigations were performed with a conventional transmissibility approach. The results demonstrate that the proposed transmissibility is more sensitive than conventional transmissibility, and the former is a promising tool for structural damage detection at the incipient stage.