Shun Zhang

Zitiert von

	Alle	Seit 2019
Zitate	506	409
h-index	10	10
i10-index	10	10

140

105

201420152016201720182019202020212022202320243 5 16 32 39 23 41 41 49 121 134

Öffentlicher Zugriff

Alle anzeigen

9 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Peter StoneProfessor of Computer Science, The University of Texas at AustinBestätigte E-Mail-Adresse bei cs.utexas.edu
Chuang GanUMass Amherst | MIT-IBM Watson AI LabBestätigte E-Mail-Adresse bei csail.mit.edu
Tsz-Chiu AuUlsan National Institute of Science and TechnologyBestätigte E-Mail-Adresse bei cs.utexas.edu
Edmund DurfeeProfessor Emeritus of Computer Science and Engineering, University of MichiganBestätigte E-Mail-Adresse bei umich.edu
Satinder SinghGoogle DeepMind / U. of MichiganBestätigte E-Mail-Adresse bei umich.edu
Dana BallardProfessor of Computer Science, University of Texas at AustinBestätigte E-Mail-Adresse bei cs.utexas.edu
Mary HayhoeProfessor of Psychology, University of Texas AustinBestätigte E-Mail-Adresse bei utexas.edu
Matthew TongIBM ResearchBestätigte E-Mail-Adresse bei alumni.ucsd.edu
Xin ZhangIBM Thomas J. Watson Research Center / Columbia UniversityBestätigte E-Mail-Adresse bei us.ibm.com
Ruohan ZhangStanford UniversityBestätigte E-Mail-Adresse bei stanford.edu

Folgen

Shun Zhang

MIT-IBM Watson AI Lab

Bestätigte E-Mail-Adresse bei ibm.com - Startseite

reinforcement learning human-agent interaction value alignment AI safety


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Autonomous intersection management for semi-autonomous vehicles TC Au, S Zhang, P Stone Routledge Handbook of Transportation, 88-104, 2015	147	2015
Prompting Decision Transformer for Few-Shot Policy Generalization M Xu, Y Shen, S Zhang, Y Lu, D Zhao, JB Tenenbaum, C Gan International Conference on Machine Learning, 2022	98	2022
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan arXiv preprint arXiv:2303.05510, 2023	71	2023
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes. S Zhang, EH Durfee, S Singh IJCAI, 4867-4873, 2018	45	2018
Determining placements of influencing agents in a flock K Genter, S Zhang, P Stone Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	30	2015
Hyper-decision transformer for efficient online policy adaptation M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan arXiv preprint arXiv:2304.08487, 2023	29	2023
Semi-autonomous intersection management. TC Au, S Zhang, P Stone AAMAS, 1451-1452, 2014	28	2014
Modeling sensory-motor decisions in natural behavior R Zhang, S Zhang, MH Tong, Y Cui, CA Rothkopf, DH Ballard, ... PLoS computational biology 14 (10), e1006518, 2018	13	2018
Querying to find a safe policy under uncertain safety constraints in markov decision processes S Zhang, E Durfee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2552-2559, 2020	11	2020
From specification to topology: Automatic power converter design via reinforcement learning S Fan, N Cao, S Zhang, J Li, X Guo, X Zhang 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021	10	2021
Approximately-optimal queries for planning in reward-uncertain Markov decision processes S Zhang, E Durfee, S Singh Proceedings of the International Conference on Automated Planning and …, 2017	9	2017
Modeling Task Control of Gaze M Tong, S Zhang, L Johnson, D Ballard, M Hayhoe Journal of Vision 15 (12), 784-784, 2015	4	2015
Improving reinforcement learning from human feedback with efficient reward model ensemble S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan arXiv preprint arXiv:2401.16635, 2024	3	2024
Adaptive Online Replanning with Diffusion Models S Zhou, Y Du, S Zhang, M Xu, Y Shen, W Xiao, DY Yeung, C Gan Advances in Neural Information Processing Systems 36, 2023	3	2023
Power Converter Circuit Design Automation using Parallel Monte Carlo Tree Search S Fan, S Zhang, J Liu, N Cao, X Guo, J Li, X Zhang ACM Transactions on Design Automation of Electronic Systems (TODAES), 2022	2	2022
Modeling Sensorimotor Behavior through Modular Inverse Reinforcement Learning with Discount Factors R Zhang, S Zhang, MH Tong, MM Hayhoe, DH Ballard Journal of Vision 17 (10), 1267-1267, 2017	1	2017
Parameterized modular inverse reinforcement learning S Zhang	1	2015
Intersection Management With Constraint-Based Reservation Systems TC Au, S Zhang, P Stone Autonomous Robots and Multirobot Systems (ARMS), 2014	1	2014
LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits CC Chang, Y Shen, S Fan, J Li, S Zhang, N Cao, Y Chen, X Zhang Forty-first International Conference on Machine Learning, 2024		2024
Efficiently Finding Approximately-Optimal Queries for Improving Policies and Guaranteeing Safety S Zhang		2020

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren