Data leakage detection algorithm based on task sequences and probabilities

作者:

Highlights:

摘要

In this paper we propose a novel algorithm to detect anomalous user behaviour in computer sessions. We first identify the behavioural profile of each authorized user from the computational tasks they usually carry out on the files of the information system. A new session is then codified as 2-length sequences and an algorithm based on the probability of those sequences is applied. The activities classified as possible anomalies are double-checked by applying Markov chains. The procedure has been proved efficient in terms of high detection accuracy and low false positive rate. It has been validate on a real database provided by a governmental institution of Ecuador and also on a public dataset of Unix commands. Besides, the algorithm has been shown efficient regarding computational time and the overhead of this monitoring software is low.

论文关键词:Anomaly detection,Computer user behaviour,Markov chains,Data leakage,Knowledge-based decision system

论文评审过程:Received 11 May 2016, Revised 4 January 2017, Accepted 5 January 2017, Available online 6 January 2017, Version of Record 15 February 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.01.009