Yuexiang Zhai

Zitiert von

	Alle	Seit 2019
Zitate	524	523
h-index	12	12
i10-index	13	13

240

120

180

2019202020212022202320247 25 60 52 140 236

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Yi Ma (马毅)Professor of EECS, UC Berkeley; Director of IDS & Head of CS, University of Hong KongBestätigte E-Mail-Adresse bei eecs.berkeley.edu
Shengbang TongNYU CourantBestätigte E-Mail-Adresse bei berkeley.edu
Sergey LevineUC Berkeley, Physical IntelligenceBestätigte E-Mail-Adresse bei eecs.berkeley.edu
Qing QuAssistant Professor, Dept. of EECS, University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Zhihui ZhuAssistant Professor, Ohio State UniversityBestätigte E-Mail-Adresse bei osu.edu
Saining XieAssistant Professor at the Courant Institute, New York UniversityBestätigte E-Mail-Adresse bei nyu.edu
Xiao Li (李虓)Ph.D. candidate at University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Zhengyuan ZhouDept of Technology, Operations and Statistics at NYU SternBestätigte E-Mail-Adresse bei stern.nyu.edu
John WrightElectrical Engineering, Columbia UniversityBestätigte E-Mail-Adresse bei columbia.edu
Yann LeCunChief AI Scientist at Facebook & Silver Professor at the Courant Institute, New York UniversityBestätigte E-Mail-Adresse bei cs.nyu.edu
Yong Jae LeeAssociate Professor of Computer Sciences, UW-MadisonBestätigte E-Mail-Adresse bei wisc.edu
Mu CaiCS Ph.D. Student, University of Wisconsin-MadisonBestätigte E-Mail-Adresse bei cs.wisc.edu
Xiao LiThe Chinese University of Hong Kong, ShenzhenBestätigte E-Mail-Adresse bei cuhk.edu.cn
Yuqian ZhangAssistant Professor, Rutgers UniversityBestätigte E-Mail-Adresse bei rutgers.edu
Zitong YangStanford UniversityBestätigte E-Mail-Adresse bei stanford.edu
Zhenyu LiaoApplied Scientist in Amazon Inc.Bestätigte E-Mail-Adresse bei amazon.com
Li-Yi WeiAdobe ResearchBestätigte E-Mail-Adresse bei adobe.com
Haozhi QiUC BerkeleyBestätigte E-Mail-Adresse bei berkeley.edu
Yichao ZhouUC BerkeleyBestätigte E-Mail-Adresse bei berkeley.edu
Qi SunNew York UniversityBestätigte E-Mail-Adresse bei nyu.edu

Folgen

Yuexiang Zhai

Sonstige NamenSimon Zhai

UC Berkeley

Bestätigte E-Mail-Adresse bei berkeley.edu - Startseite

Artificial Intelligence Machine Learning Reinforcement Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Complete dictionary learning via l4-norm maximization over the orthogonal group Y Zhai, Z Yang, Z Liao, J Wright, Y Ma Journal of Machine Learning Research 21 (165), 1-68, 2020	68	2020
Learning to Reconstruct 3D Manhattan Wireframes from a Single Image Y Zhou, H Qi, Y Zhai, Q Sun, Z Chen, LY Wei, Y Ma International Conference on Computer Vision (ICCV), 2019, 2019	66	2019
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma Conference on Parsimony and Learning, 202-227, 2024	64	2024
Eyes wide shut? exploring the visual shortcomings of multimodal llms S Tong, Z Liu, Y Zhai, Y Ma, Y LeCun, S Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	64	2024
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning M Nakamoto, Y Zhai, A Singh, M Sobol Mark, Y Ma, C Finn, A Kumar, ... Advances in Neural Information Processing Systems 36, 2024	61	2024
Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity A Gupta, A Pacchiano, Y Zhai, S Kakade, S Levine Advances in Neural Information Processing Systems 35, 15281-15295, 2022	39	2022
Geometric analysis of nonconvex optimization landscapes for overcomplete learning Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu International Conference on Learning Representations, 2020	31	2020
Convolutional normalization: Improving deep convolutional network robustness and training S Liu, X Li, Y Zhai, C You, Z Zhu, C Fernandez-Granda, Q Qu Advances in neural information processing systems 34, 28919-28928, 2021	25	2021
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness Y Zhai, H Mehta, Z Zhou, Y Ma International Conference on Learning Representations (ICLR), 2020, 2020	21	2020
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning Y Zhai, C Baek, Z Zhou, J Jiao, Y Ma Journal of Artificial Intelligence Research 73, 847-896, 2022	20	2022
Analysis of the optimization landscapes for overcomplete representation learning Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu arXiv preprint arXiv:1912.02427, 2019	17	2019
Lmrl gym: Benchmarks for multi-turn reinforcement learning with language models M Abdulhai, I White, CV Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine	15	2023
Understanding the complexity gains of single-task rl with a curriculum Q Li, Y Zhai, Y Ma, S Levine International Conference on Machine Learning, 20412-20451, 2023	12	2023
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Y Zhai, H Bai, Z Lin, J Pan, S Tong, Y Zhou, A Suhr, S Xie, Y LeCun, Y Ma, ... arXiv preprint arXiv:2405.10292, 2024	5	2024
RLIF: Interactive Imitation Learning as Reinforcement Learning J Luo, P Dong, Y Zhai, Y Ma, S Levine The Twelfth International Conference on Learning Representations (ICLR), 2024	4	2024
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning. M Nakamoto, Y Zhai, A Singh, MS Mark, Y Ma, C Finn, A Kumar, S Levine arXiv preprint arXiv:2303.05479, 2023	4	2023
Closed-Loop Transcription via Convolutional Sparse Coding X Dai, K Chen, S Tong, J Zhang, X Gao, M Li, D Pai, Y Zhai, XI Yuan, ... Conference on Parsimony and Learning. PMLR, 2024., 2024	3	2024
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? Y Yu, S Buchanan, D Pai, T Chu, Z Wu, S Tong, H Bai, Y Zhai, ... arXiv preprint arXiv:2311.13110, 2023	3	2023
Complete dictionary learning via l4-norm maximization over the orthogonal group Y Zhai, Z Yang, Z Liao, J Wright, Y Ma arXiv preprint arXiv:1906.02435, 2019	2	2019
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement R Zhang, Y Zhai, A Zanette arXiv preprint arXiv:2402.15703, 2024		2024

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren