The in-sample softmax for offline reinforcement learning C Xiao, H Wang, Y Pan, A White, M White arXiv preprint arXiv:2302.14372, 2023 | 19 | 2023 |
Investigating the properties of neural network representations in reinforcement learning H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ... Artificial Intelligence, 104100, 2024 | 15 | 2024 |
Measuring and mitigating interference in reinforcement learning V Liu, H Wang, RY Tao, K Javed, A White, M White Conference on Lifelong Learning Agents, 781-795, 2023 | 3 | 2023 |
No more pesky hyperparameters: Offline hyperparameter tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... arXiv preprint arXiv:2205.08716, 2022 | 3 | 2022 |
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay H Zhang, C Xiao, H Wang, J Jin, M Müller The Eleventh International Conference on Learning Representations, 2022 | 1 | 2022 |