Folgen
Prabhat Nagarajan
Prabhat Nagarajan
PhD Student | The University of Alberta
Bestätigte E-Mail-Adresse bei ualberta.ca - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations
D Brown, W Goo, P Nagarajan, S Niekum
International Conference on Machine Learning, 783-792, 2019
3982019
ChainerRL: A Deep Reinforcement Learning Library
Y Fujita, P Nagarajan, T Kataoka, T Ishikawa
Journal of Machine Learning Research 22 (77), 1-14, 2021
1402021
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
AAAI 2019 Workshop on Reproducible AI, 2019
672019
The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden, 2018
352018
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Y Fujita, K Uenishi, A Ummadisingu, P Nagarajan, S Masuda, MY Castro
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020
242020
Learning Latent State Spaces for Planning through Reward Prediction
A Havens, Y Ouyang, P Nagarajan, Y Fujita
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
72019
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
ZW Hong, P Nagarajan, G Maeda
European Conference on Machine Learning and Principles and Practice of …, 2021
42021
Reconnaissance for Reinforcement Learning with Safety Constraints
S Maeda, H Watahiki, Y Ouyang, S Okada, M Koyama, P Nagarajan
European Conference on Machine Learning and Principles and Practice of …, 2021
32021
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
arXiv preprint arXiv:2312.02355, 2023
12023
Swarm-inspired Reinforcement Learning via Collaborative Inter-agent Knowledge Distillation
ZW Hong, P Nagarajan, G Maeda
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
2019
Nondeterminism as a Reproducibility Challenge for Deep Reinforcement Learning
PM Nagarajan
The University of Texas at Austin, 2018
2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–11