Robotic manipulation of multiple objects as a POMDP

作者：

摘要

This paper investigates manipulation of multiple unknown objects in a crowded environment. Because of incomplete knowledge due to unknown objects and occlusions in visual observations, object observations are imperfect and action success is uncertain, making planning challenging. We model the problem as a partially observable Markov decision process (POMDP), which allows a general reward based optimization objective and takes uncertainty in temporal evolution and partial observations into account. In addition to occlusion dependent observation and action success probabilities, our POMDP model also automatically adapts object specific action success probabilities. To cope with the changing system dynamics and performance constraints, we present a new online POMDP method based on particle filtering that produces compact policies. The approach is validated both in simulation and in physical experiments in a scenario of moving dirty dishes into a dishwasher. The results indicate that: 1) a greedy heuristic manipulation approach is not sufficient, multi-object manipulation requires multi-step POMDP planning, and 2) on-line planning is beneficial since it allows the adaptation of the system dynamics model based on actual experience.

论文关键词：POMDP,Planning under uncertainty,Task planning,Manipulation,Unknown objects,Multiple objects,Cluttered environment

论文评审过程：Revised 15 February 2015, Accepted 1 April 2015, Available online 3 April 2015, Version of Record 25 April 2017.

论文官网地址：https://doi.org/10.1016/j.artint.2015.04.001