Jelena Luketina

Conference paper (1)

1 records found

Transient non-stationarity and generalisation in deep reinforcement learning

Conference paper (2021) - Maximilian Igl (author) , Gregory Farquhar (author) , Jelena Luketina (author) , J.W. Böhmer (author) , Shimon Whiteson (author)

Non-stationarity can arise in Reinforcement Learning (RL) even in stationary environments. For example, most RL algorithms collect new data throughout training, using a non-stationary behaviour policy. Due to the transience of this non-stationarity, it is often not explicitly add ...