Computing the fault tolerance of multi-agent deployment

作者:

Highlights:

摘要

A deployment of a multi-agent system on a network refers to the placement of one or more copies of each agent on network hosts, in such a manner that the memory constraints of each node are satisfied. Finding the deployment that is most likely to tolerate faults (i.e. have at least one copy of each agent functioning and in communication with other agents) is a challenge. In this paper, we address the problem of finding the probability of survival of a deployment (i.e. the probability that a deployment will tolerate faults), under the assumption that node failures are independent. We show that the problem of computing the survival probability of a deployment is at least NP-hard. Moreover, it is hard to approximate. We produce two algorithms to accurately compute the probability of survival of a deployment—these algorithms are expectedly exponential. We also produce five heuristic algorithms to estimate survival probabilities—these algorithms work in acceptable time frames. We report on a detailed set of experiments to determine the conditions under which some of these algorithms perform better than the others.

论文关键词:Multi-agent deployment,Fault tolerance,Algorithms,Replication

论文评审过程:Received 11 June 2007, Revised 12 November 2008, Accepted 14 November 2008, Available online 25 November 2008.

论文官网地址:https://doi.org/10.1016/j.artint.2008.11.007