Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

Valletta, Peter; Pérez-Dattari, Rodrigo; Kober, Jens

doi:10.1109/ICRA48506.2021.9561686

Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

Conference paper (2021)

Authors

Peter Valletta Student

Rodrigo Pérez-Dattari Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Jens Kober Learning & Autonomous Control - Mechanical, Maritime and Materials Engineering

Research Group

Learning & Autonomous Control (Mechanical, Maritime and Materials Engineering) (TU Delft)

DOI: https://doi.org/10.1109/ICRA48506.2021.9561686

To reference this document use:

http://resolver.tudelft.nl/uuid:165bfbfe-4b08-45bd-b3b3-5a0e89b94a6a

More Info

expand_more

Published Date

2021

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Mechanical, Maritime and Materials Engineering

Department

Cognitive Robotics

Research Group

Learning & Autonomous Control

Abstract

Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.

Files

Imitation_Learning_with_Incons... (pdf)

(pdf | 3.96 Mb)

- Embargo expired in 18-04-2022

Unknown license