Jordi Smit

Conference paper (2)

2 records found

PEBL: Pessimistic Ensembles for Offline Deep Reinforcement Learning

Conference paper (2021) - Jordi Smit (author) , C.T. Ponnambalam (author) , Matthijs Spaan (author) , Frans A Oliehoek (author)

Offline reinforcement learning (RL), or learning from a fixed data set, is an attractive alternative to online RL. Offline RL promises to address the cost and safety implications of tak- ing numerous random or bad actions online, a crucial aspect of traditional RL that makes it d ...

OffSide

Learning to Identify Mistakes in Boundary Conditions

Conference paper (2020) - Jón Arnar Briem (author) , Jordi Smit (author) , Hendrig Sellik (author) , Pavel Rapoport (author) , Gousios Gousios (author) , Maurício Aniche (author)

Mistakes in boundary conditions are the cause of many bugs in software. These mistakes happen when, e.g., developers make use of '<' or '>' in cases where they should have used '<=' or '>='. Mistakes in boundary conditions are often hard to find and manually detecting ...