Folgen
Qiyang Li
Titel
Zitiert von
Zitiert von
Jahr
Offline reinforcement learning as one big sequence modeling problem
M Janner, Q Li, S Levine
Advances in neural information processing systems 34, 1273-1286, 2021
5542021
Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer
S Huang, Q Li, C Anil, X Bao, S Oore, RB Grosse
arXiv preprint arXiv:1811.09620, 2018
1212018
Deep neural networks for improved, impromptu trajectory tracking of quadrotors
Q Li, J Qian, Z Zhu, X Bao, MK Helwa, AP Schoellig
2017 IEEE International Conference on Robotics and Automation (ICRA), 5183-5189, 2017
1062017
Preventing gradient attenuation in lipschitz constrained convolutional networks
Q Li, S Haque, C Anil, J Lucas, RB Grosse, JH Jacobsen
Advances in neural information processing systems 32, 2019
992019
Building a winning self-driving car in six months
K Burnett, A Schimpe, S Samavi, M Gridseth, CW Liu, Q Li, Z Kroeze, ...
2019 International Conference on Robotics and Automation (ICRA), 9583-9589, 2019
202019
Efficient deep reinforcement learning requires regulating overfitting
Q Li, A Kumar, I Kostrikov, S Levine
arXiv preprint arXiv:2304.10466, 2023
192023
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
122023
Learning of coordination policies for robotic swarms
Q Li, X Du, Y Huang, Q Sykora, AP Schoellig
arXiv preprint arXiv:1709.06620, 2017
92017
REFACTOR: Learning to Extract Theorems from Proofs
JP Zhou, Y Wu, Q Li, R Grosse
arXiv preprint arXiv:2402.17032, 2024
32024
Accelerating exploration with unlabeled prior data
Q Li, J Zhang, D Ghosh, A Zhang, S Levine
Advances in Neural Information Processing Systems 36, 2024
32024
AdaCat: Adaptive categorical discretization for autoregressive models
Q Li, A Jain, P Abbeel
Uncertainty in Artificial Intelligence, 1188-1198, 2022
12022
R-LAtte: Attention Module for Visual Control via Reinforcement Learning
M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel
2020
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–12