Folgen
Sam Toyer
Sam Toyer
OpenAI
Bestätigte E-Mail-Adresse bei berkeley.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Variational discriminator bottleneck: Improving imitation learning, inverse RL, and GANs by constraining information flow
XB Peng, A Kanazawa, S Toyer, P Abbeel, S Levine
ICLR 2019, 2018
2752018
Openai o1 system card
A Jaech, A Kalai, A Lerer, A Richardson, A El-Kishky, A Low, A Helyar, ...
arXiv preprint arXiv:2412.16720, 2024
1402024
Action Schema Networks: Generalised Policies with Deep Learning
S Toyer, F Trevizan, S Thiebaux, L Xie
AAAI Conference on Artificial Intelligence (AAAI), 2018
1302018
Asnets: Deep learning for generalised planning
S Toyer, S Thiébaux, F Trevizan, L Xie
Journal of Artificial Intelligence Research 68, 1-68, 2020
1022020
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
S Toyer, O Watkins, EA Mendes, J Svegliato, L Bailey, T Wang, I Ong, ...
ICLR 2024, 2023
85*2023
A strongreject for empty jailbreaks
A Souly, Q Lu, D Bowen, T Trinh, E Hsieh, S Pandey, P Abbeel, ...
arXiv preprint arXiv:2402.10260, 2024
69*2024
imitation: Clean imitation learning implementations
A Gleave, M Taufeeque, J Rocamonde, E Jenner, SH Wang, S Toyer, ...
arXiv preprint arXiv:2211.11972, 2022
682022
Human pose forecasting via deep Markov models
S Toyer, A Cherian, T Han, S Gould
International Conference on Digital Image Computing: Techniques and …, 2017
582017
The MAGICAL Benchmark for Robust Imitation
S Toyer, R Shah, A Critch, S Russell
NeurIPS 2020, 2020
542020
An Empirical Investigation of Representation Learning for Imitation
X Chen, S Toyer, C Wild, S Emmons, I Fischer, KH Lee, N Alex, SH Wang, ...
NeurIPS 2021, Datasets and Benchmarks Track, 2021
332021
Deliberative alignment: Reasoning enables safer language models
MY Guan, M Joglekar, E Wallace, S Jain, B Barak, A Helyar, R Dias, ...
arXiv preprint arXiv:2412.16339, 2024
282024
A primer on maximum causal entropy inverse reinforcement learning
A Gleave, S Toyer
arXiv preprint arXiv:2203.11409, 2022
242022
The imitation library for imitation learning and inverse reinforcement learning
S Wang, S Toyer, A Gleave, S Emmons
242020
Guiding search with generalized policies for probabilistic planning
W Shen, F Trevizan, S Toyer, S Thiébaux, L Xie
Proceedings of the international symposium on combinatorial search 10 (1 …, 2019
192019
Publishing and Using Earth Observation Data with the RDF Data Cube and the Discrete Global Grid System
D Brizhinev, S Toyer, K Taylor
https://www.w3.org/TR/eo-qb/, 2017
192017
Derail: Diagnostic environments for reward and imitation learning
P Freire, A Gleave, S Toyer, S Russell
arXiv preprint arXiv:2012.01365, 2020
102020
seals: Suite of environments for algorithms that learn specifications
A Gleave, P Freire, S Wang, S Toyer
92020
Computer vision training using paired image data
S Gould, S Toyer, D Reiner
US Patent App. 16/360,954, 2019
92019
Trading inference-time compute for adversarial robustness
W Zaremba, E Nitishinskaya, B Barak, S Lin, S Toyer, Y Yu, R Dias, ...
arXiv preprint arXiv:2501.18841, 2025
72025
Variational discriminator bottleneck: improving imitation learning
XB Peng, A Kanazawa, S Toyer, P Abbeel, S Levine
Inverse RL, and GANs by Constraining Information Flow.[(accessed on 29 …, 2018
62018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20