Hv
Herke van Hoof
3 records found
1
Going Beyond Popularity and Positivity Bias
Correcting for Multifactorial Bias in Recommender Systems
Two typical forms of bias in user interaction data with recommender systems (RSs) are popularity bias and positivity bias, which manifest themselves as the over-representation of interactions with popular items or items that users prefer, respectively. Debiasing methods aim to mi
...
Mitigating Exposure Bias in Online Learning to Rank Recommendation
A Novel Reward Model for Cascading Bandits
Exposure bias is a well-known issue in recommender systems where items and suppliers are not equally represented in the recommendation results. This bias becomes particularly problematic over time as a few items are repeatedly over-represented in recommendation lists, leading to
...
MDP homomorphic networks
Group symmetries in reinforcement learning
This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit k
...