Deep Reinforcement Learning With Dynamic Graphs for Adaptive Informative Path Planning

Vashisth, Apoorva; Ruckin, Julius; Magistri, Federico; Stachniss, Cyrill; Popovic, Marija

Deep Reinforcement Learning With Dynamic Graphs for Adaptive Informative Path Planning

Journal article (2024)

Authors

Apoorva Vashisth Indian Institute of Technology Kharagpur

Julius Ruckin Universität Bonn

Federico Magistri Universität Bonn

Cyrill Stachniss Universität Bonn, University of Oxford

Marija Popovic Universität Bonn, Control & Simulation

Research Group

Control & Simulation

Reinforcement learning Motion and path planning Robotics and automation in agriculture and forestry

To reference this document use:

http://resolver.tudelft.nl/uuid:6f5b1c4a-51c9-425f-a5f6-453b926bf8c4

More Info

expand_more

Published Date

2024

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Research Group

Control & Simulation

Abstract

Autonomousrobots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actions and the presence of unknown occlusions. To address these issues, we propose a novel deep reinforcement learning approach for adaptively replanning robot paths to map targets of interest in unknown 3D environments. A key aspect of our approach is a dynamically constructed graph that restricts planning actions local to the robot, allowing us to react to newly discovered static obstacles and targets of interest. For replanning, we propose a new reward function that balances between exploring the unknown environment and exploiting online-discovered targets of interest. Our experiments show that our method enables more efficient target discovery compared to state-of-the-art learning and non-learning baselines. We also showcase our approach for orchard monitoring using an unmanned aerial vehicle in a photorealistic simulator.

Files

Deep_Reinforcement_Learning_Wi... (pdf)

(pdf | 1.5 Mb)

- Embargo expired in 06-01-2025

License info not available