Shi Yuan Tang

Conference paper (1)

Journal article (1)

2 records found

Teacher-apprentices RL (TARL)

Leveraging complex policy distribution through generative adversarial hypernetwork in reinforcement learning

Journal article (2023) - Shi Yuan Tang (author), Athirai A. Irissappane (author), Frans A. Oliehoek (author), F.A. Oliehoek (author), Jie Zhang (author)

Typically, a Reinforcement Learning (RL) algorithm focuses in learning a single deployable policy as the end product. Depending on the initialization methods and seed randomization, learning a single policy could possibly leads to convergence to different local optima across diff ...

Learning Complex Policy Distribution with CEM Guided Adversarial Hypernetwork

Conference paper (2021) - Shi Yuan Tang (author), F.A. Oliehoek (author), Frans A. Oliehoek (author), Athirai A. Irissappane (author), Jie Zhang (author)

Cross-Entropy Method (CEM) is a gradient-free direct policy search method, which has greater stability and is insensitive to hyperparameter tuning. CEM bears similarity to population-based evolutionary methods, but, rather than using a population it uses a distribution over candi ...