Folgen
Steven Kapturowski
Steven Kapturowski
DeepMind
Bestätigte E-Mail-Adresse bei google.com
Titel
Zitiert von
Zitiert von
Jahr
Agent57: Outperforming the atari human benchmark
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ...
International conference on machine learning, 507-517, 2020
6192020
Recurrent experience replay in distributed reinforcement learning
S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney
International conference on learning representations, 2018
5292018
Never give up: Learning directed exploration strategies
AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ...
arXiv preprint arXiv:2002.06038, 2020
3182020
The DeepMind JAX Ecosystem, 2020
I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/deepmind 5, 2010
942010
Making efficient use of demonstrations to solve hard exploration problems
TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ...
arXiv preprint arXiv:1909.01387, 2019
882019
The DeepMind JAX Ecosystem
I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ...
URL http://github. com/deepmind 24, 25, 2020
602020
Human-level Atari 200x faster
S Kapturowski, V Campos, R Jiang, N Rakićević, H van Hasselt, ...
arXiv preprint arXiv:2209.07550, 2022
252022
Beyond fine-tuning: Transferring behavior in reinforcement learning
V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ...
arXiv preprint arXiv:2102.13515, 2021
212021
Revisiting Peng’s Q() for Modern Reinforcement Learning
T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ...
International Conference on Machine Learning, 5794-5804, 2021
182021
Value-driven hindsight modelling
A Guez, F Viola, T Weber, L Buesing, S Kapturowski, D Precup, D Silver, ...
Advances in Neural Information Processing Systems 33, 12499-12509, 2020
172020
Temporal difference uncertainties as a signal for exploration
S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ...
arXiv preprint arXiv:2010.02255, 2020
152020
Coverage as a principle for discovering transferable behavior in reinforcement learning
V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...
92020
RLax: Reinforcement Learning in JAX, 2020
D Budden, M Hessel, J Quan, S Kapturowski, K Baumli, S Bhupatiraju, ...
URL http://github. com/deepmind/rlax, 0
8
Never give up: Learning directed exploration strategies
A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ...
arXiv e-prints, arXiv: 2002.06038, 2020
62020
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page
AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ...
arXiv preprint arXiv:2003.13350, 2020
52020
Unlocking the power of representations in long-term novelty-based exploration
A Saade, S Kapturowski, D Calandriello, C Blundell, P Sprechmann, ...
arXiv preprint arXiv:2305.01521, 2023
42023
Offline Actor-Critic Reinforcement Learning Scales to Large Models
JT Springenberg, A Abdolmaleki, J Zhang, O Groth, M Bloesch, T Lampe, ...
arXiv preprint arXiv:2402.05546, 2024
12024
Transformers need glasses! Information over-squashing in language tasks
F Barbero, A Banino, S Kapturowski, D Kumaran, JGM Araújo, A Vitvitskyi, ...
arXiv e-prints, arXiv: 2406.04267, 2024
2024
Jointly learning exploratory and non-exploratory action selection policies
AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ...
US Patent App. 18/334,112, 2024
2024
Unlocking the Power of Representations in Long-term Novelty-based Exploration
S Kapturowski, A Saade, D Calandriello, C Blundell, P Sprechmann, ...
Second Agent Learning in Open-Endedness Workshop, 2023
2023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20