I. Makarov

Bachelor thesis (1)

1 records found

The Role of Feedback Variety in Reinforcement Learning from Human Feedback

Bachelor thesis (2024) - I. Makarov (author) , Luciano C. Siebert (mentor) , A. Mone (mentor) , J.W. Böhmer (graduation committee member)

Reinforcement Learning from Human Feedback (RLHF) offers a powerful approach to training agents in environments where defining an explicit reward function is challenging by learning from human feedback provided in various forms. This research evaluates three common feedback types ...