Folgen
Shun Zhang
Shun Zhang
MIT-IBM Watson AI Lab
Bestätigte E-Mail-Adresse bei ibm.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Autonomous intersection management for semi-autonomous vehicles
TC Au, S Zhang, P Stone
Routledge Handbook of Transportation, 88-104, 2015
1422015
Prompting Decision Transformer for Few-Shot Policy Generalization
M Xu, Y Shen, S Zhang, Y Lu, D Zhao, JB Tenenbaum, C Gan
International Conference on Machine Learning, 2022
812022
Planning with large language models for code generation
S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan
arXiv preprint arXiv:2303.05510, 2023
532023
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes.
S Zhang, EH Durfee, S Singh
IJCAI, 4867-4873, 2018
442018
Determining placements of influencing agents in a flock
K Genter, S Zhang, P Stone
Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015
302015
Semi-autonomous intersection management.
TC Au, S Zhang, P Stone
AAMAS, 1451-1452, 2014
292014
Hyper-decision transformer for efficient online policy adaptation
M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan
arXiv preprint arXiv:2304.08487, 2023
172023
Modeling sensory-motor decisions in natural behavior
R Zhang, S Zhang, MH Tong, Y Cui, CA Rothkopf, DH Ballard, ...
PLoS computational biology 14 (10), e1006518, 2018
132018
Querying to find a safe policy under uncertain safety constraints in markov decision processes
S Zhang, E Durfee, S Singh
Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2552-2559, 2020
102020
Approximately-optimal queries for planning in reward-uncertain Markov decision processes
S Zhang, E Durfee, S Singh
Proceedings of the International Conference on Automated Planning and …, 2017
92017
From specification to topology: Automatic power converter design via reinforcement learning
S Fan, N Cao, S Zhang, J Li, X Guo, X Zhang
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021
82021
Modeling Task Control of Gaze
M Tong, S Zhang, L Johnson, D Ballard, M Hayhoe
Journal of Vision 15 (12), 784-784, 2015
42015
Adaptive Online Replanning with Diffusion Models
S Zhou, Y Du, S Zhang, M Xu, Y Shen, W Xiao, DY Yeung, C Gan
Advances in Neural Information Processing Systems 36, 2023
32023
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan
arXiv preprint arXiv:2401.16635, 2024
12024
Power Converter Circuit Design Automation using Parallel Monte Carlo Tree Search
S Fan, S Zhang, J Liu, N Cao, X Guo, J Li, X Zhang
ACM Transactions on Design Automation of Electronic Systems (TODAES), 2022
12022
Modeling Sensorimotor Behavior through Modular Inverse Reinforcement Learning with Discount Factors
R Zhang, S Zhang, MH Tong, MM Hayhoe, DH Ballard
Journal of Vision 17 (10), 1267-1267, 2017
12017
Parameterized modular inverse reinforcement learning
S Zhang
12015
Intersection Management With Constraint-Based Reservation Systems
TC Au, S Zhang, P Stone
Autonomous Robots and Multirobot Systems (ARMS), 2014
12014
Efficiently Finding Approximately-Optimal Queries for Improving Policies and Guaranteeing Safety
S Zhang
2020
On Querying for Safe Optimality in Factored Markov Decision Processes
S Zhang, EH Durfee, S Singh
Proceedings of the 17th International Conference on Autonomous Agents and …, 2018
2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20