Steven Kapturowski

Zitiert von

	Alle	Seit 2019
Zitate	1925	1923
h-index	12	12
i10-index	12	12

560

280

140

420

20192020202120222023202433 189 397 435 552 310

Folgen

Steven Kapturowski

DeepMind

Bestätigte E-Mail-Adresse bei google.com

Reinforcement Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	644	2020
Recurrent experience replay in distributed reinforcement learning S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney International conference on learning representations, 2018	539	2018
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	329	2020
The DeepMind JAX Ecosystem, 2020 I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind 18, 2010	97	2010
Making efficient use of demonstrations to solve hard exploration problems TL Paine, C Gulcehre, B Shahriari, M Denil, M Hoffman, H Soyer, ... arXiv preprint arXiv:1909.01387, 2019	91	2019
The DeepMind JAX Ecosystem I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind 24, 25, 2020	62	2020
Human-level Atari 200x faster S Kapturowski, V Campos, R Jiang, N Rakićević, H van Hasselt, ... arXiv preprint arXiv:2209.07550, 2022	26	2022
Beyond fine-tuning: Transferring behavior in reinforcement learning V Campos, P Sprechmann, S Hansen, A Barreto, S Kapturowski, ... arXiv preprint arXiv:2102.13515, 2021	23	2021
Revisiting Peng’s Q() for Modern Reinforcement Learning T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ... International Conference on Machine Learning, 5794-5804, 2021	22	2021
The DeepMind JAX Ecosystem, 2020 IB DeepMind, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/google-deepmind, 0	21
Value-driven hindsight modelling A Guez, F Viola, T Weber, L Buesing, S Kapturowski, D Precup, D Silver, ... Advances in Neural Information Processing Systems 33, 12499-12509, 2020	18	2020
Temporal difference uncertainties as a signal for exploration S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ... arXiv preprint arXiv:2010.02255, 2020	15	2020
Coverage as a principle for discovering transferable behavior in reinforcement learning V Campos, P Sprechmann, SS Hansen, A Barreto, C Blundell, A Vitvitskyi, ...	9	2021
RLax: Reinforcement Learning in JAX, 2020 D Budden, M Hessel, J Quan, S Kapturowski, K Baumli, S Bhupatiraju, ... URL http://github. com/deepmind/rlax, 0	8
Never give up: Learning directed exploration strategies A Puigdomènech Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, ... arXiv e-prints, arXiv: 2002.06038, 2020	7	2020
Agent57: Outperforming the Atari Human Benchmark. arXiv e-prints, page AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, D Guo, ... arXiv preprint arXiv:2003.13350, 2020	5	2020
Unlocking the power of representations in long-term novelty-based exploration A Saade, S Kapturowski, D Calandriello, C Blundell, P Sprechmann, ... arXiv preprint arXiv:2305.01521, 2023	4	2023
Transformers need glasses! Information over-squashing in language tasks F Barbero, A Banino, S Kapturowski, D Kumaran, JGM Araújo, A Vitvitskyi, ... arXiv preprint arXiv:2406.04267, 2024	3	2024
Offline actor-critic reinforcement learning scales to large models JT Springenberg, A Abdolmaleki, J Zhang, O Groth, M Bloesch, T Lampe, ... arXiv preprint arXiv:2402.05546, 2024	2	2024
Jointly learning exploratory and non-exploratory action selection policies AP Badia, P Sprechmann, A Vitvitskyi, Z Guo, B Piot, SJ Kapturowski, ... US Patent App. 18/334,112, 2024		2024

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von