Folgen
Qiyang Li
Titel
Zitiert von
Zitiert von
Jahr
Offline reinforcement learning as one big sequence modeling problem
M Janner, Q Li, S Levine
Advances in neural information processing systems 34, 1273-1286, 2021
7502021
Timbretron: A wavenet (cyclegan (cqt (audio))) pipeline for musical timbre transfer
S Huang, Q Li, C Anil, X Bao, S Oore, RB Grosse
arXiv preprint arXiv:1811.09620, 2018
1402018
Deep neural networks for improved, impromptu trajectory tracking of quadrotors
Q Li, J Qian, Z Zhu, X Bao, MK Helwa, AP Schoellig
2017 IEEE International Conference on Robotics and Automation (ICRA), 5183-5189, 2017
1132017
Preventing gradient attenuation in lipschitz constrained convolutional networks
Q Li, S Haque, C Anil, J Lucas, RB Grosse, JH Jacobsen
Advances in neural information processing systems 32, 2019
1122019
Openeqa: Embodied question answering in the era of foundation models
A Majumdar, A Ajay, X Zhang, P Putta, S Yenamandra, M Henaff, S Silwal, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
652024
Efficient deep reinforcement learning requires regulating overfitting
Q Li, A Kumar, I Kostrikov, S Levine
arXiv preprint arXiv:2304.10466, 2023
312023
Learning Visuotactile Skills with Two Multifingered Hands
T Lin, Y Zhang, Q Li, H Qi, B Yi, S Levine, J Malik
arXiv preprint arXiv:2404.16823, 2024
272024
Building a winning self-driving car in six months
K Burnett, A Schimpe, S Samavi, M Gridseth, CW Liu, Q Li, Z Kroeze, ...
2019 International Conference on Robotics and Automation (ICRA), 9583-9589, 2019
232019
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
152023
Learning of coordination policies for robotic swarms
Q Li, X Du, Y Huang, Q Sykora, AP Schoellig
arXiv preprint arXiv:1709.06620, 2017
102017
Accelerating exploration with unlabeled prior data
Q Li, J Zhang, D Ghosh, A Zhang, S Levine
Advances in Neural Information Processing Systems 36, 67434-67458, 2023
82023
REFACTOR: Learning to Extract Theorems from Proofs
JP Zhou, Y Wu, Q Li, R Grosse
arXiv preprint arXiv:2402.17032, 2024
42024
AdaCat: Adaptive categorical discretization for autoregressive models
Q Li, A Jain, P Abbeel
Uncertainty in Artificial Intelligence, 1188-1198, 2022
22022
R-LAtte: Attention Module for Visual Control via Reinforcement Learning
M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–14