Spatio-temporal outlier detection algorithms based on computing behavioral outlierness factor

作者:

Highlights:

摘要

A major task in spatio-temporal outlier detection is to identify objects that exhibit abnormal behavior either spatially, and/or temporally. There have only been a few algorithms proposed for detecting spatial and/or temporal outliers. One example is the Local Density-Based Spatial Clustering of Applications with Noise (LDBSCAN). Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is mainly for clustering; it just tells us whether an object belongs to a cluster or it is an outlier. A measure known as Local Outlier Factor (LOF) gives a quantitative measure of outlierness to each object, where a high LOF score means it is potentially an outlier. LDBSCAN algorithm, which combines the above notions, considers only the spatial context. Furthermore, the notion of a cluster is defeated (i.e. LDBSCAN may report clusters having less than the minimum required points in a cluster), and some of the outliers may not be detected because of the limitation of the existing conditions in the LDBSCAN algorithm. In this paper, we propose two algorithms, namely Spatio-Temporal Behavioral Density-based Clustering of Applications with Noise (ST-BDBCAN) and Approx-ST-BDBCAN. ST-BDBCAN algorithm adopts the proposed, new concept, called Spatio-Temporal Behavioral Outlier Factor (ST-BOF), which is a spatio-temporal extension to LOF. It also uses both spatial and temporal attributes simultaneously to define the context. By doing so, the relative importance of spatial continuity or temporal continuity appropriate to the application at hand can be established. The Approx-ST-BDBCAN algorithm achieves improved scalability, with minimal loss of detection accuracy by partitioning data points for parallel processing. Experimental results on synthetic, and buoy datasets suggest that our proposed algorithms are accurate and computationally efficient. Additionally, new Outlier Association with Hurricane Intensity Index (OAHII) measures are introduced for quantitative evaluation of the results from buoy dataset.

论文关键词:Spatio-temporal outliers,Algorithms,Behavioral outlierness factor,Cluster,Hurricane,Efficiency

论文评审过程:Received 9 April 2017, Revised 10 November 2017, Accepted 13 December 2017, Available online 18 December 2017, Version of Record 25 July 2019.

论文官网地址:https://doi.org/10.1016/j.datak.2017.12.001