Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017 | 4674 | 2017 |
Decision transformer: Reinforcement learning via sequence modeling L Chen, K Lu, A Rajeswaran, K Lee, A Grover, M Laskin, P Abbeel, ... Advances in neural information processing systems 34, 15084-15097, 2021 | 1185 | 2021 |
Palm-e: An embodied multimodal language model D Driess, F Xia, MSM Sajjadi, C Lynch, A Chowdhery, B Ichter, A Wahid, ... arXiv preprint arXiv:2303.03378, 2023 | 841 | 2023 |
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019 | 779 | 2019 |
Emergence of grounded compositional language in multi-agent populations I Mordatch, P Abbeel Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 746 | 2018 |
Language models as zero-shot planners: Extracting actionable knowledge for embodied agents W Huang, P Abbeel, D Pathak, I Mordatch International Conference on Machine Learning, 9118-9147, 2022 | 636 | 2022 |
Implicit generation and modeling with energy based models Y Du, I Mordatch Advances in Neural Information Processing Systems 32, 2019 | 630 | 2019 |
Learning with opponent-learning awareness JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch arXiv preprint arXiv:1709.04326, 2017 | 575 | 2017 |
Discovery of complex behaviors through contact-invariant optimization I Mordatch, E Todorov, Z Popović ACM Transactions on Graphics (ToG) 31 (4), 1-8, 2012 | 563 | 2012 |
Inner monologue: Embodied reasoning through planning with language models W Huang, F Xia, T Xiao, H Chan, J Liang, P Florence, A Zeng, J Tompson, ... arXiv preprint arXiv:2207.05608, 2022 | 521 | 2022 |
Emergent complexity via multi-agent competition T Bansal, J Pachocki, S Sidor, I Sutskever, I Mordatch arXiv preprint arXiv:1710.03748, 2017 | 458 | 2017 |
Rt-1: Robotics transformer for real-world control at scale A Brohan, N Brown, J Carbajal, Y Chebotar, J Dabis, C Finn, ... arXiv preprint arXiv:2212.06817, 2022 | 435 | 2022 |
Continuous adaptation via meta-learning in nonstationary and competitive environments M Al-Shedivat, T Bansal, Y Burda, I Sutskever, I Mordatch, P Abbeel arXiv preprint arXiv:1710.03641, 2017 | 394 | 2017 |
Feature-based locomotion controllers M De Lasa, I Mordatch, A Hertzmann ACM transactions on graphics (TOG) 29 (4), 1-10, 2010 | 325 | 2010 |
Rt-2: Vision-language-action models transfer web knowledge to robotic control A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ... arXiv preprint arXiv:2307.15818, 2023 | 298 | 2023 |
Frozen pretrained transformers as universal computation engines K Lu, A Grover, P Abbeel, I Mordatch Proceedings of the AAAI conference on artificial intelligence 36 (7), 7628-7636, 2022 | 258 | 2022 |
Transfer from simulation to real world through learning deep inverse dynamics model P Christiano, Z Shah, I Mordatch, J Schneider, T Blackwell, J Tobin, ... arXiv preprint arXiv:1610.03518, 2016 | 257 | 2016 |
Robust physics-based locomotion using low-dimensional planning I Mordatch, M De Lasa, A Hertzmann ACM SIGGRAPH 2010 papers, 1-8, 2010 | 240 | 2010 |
Implicit behavioral cloning P Florence, C Lynch, A Zeng, OA Ramirez, A Wahid, L Downs, A Wong, ... Conference on Robot Learning, 158-168, 2022 | 238 | 2022 |
Plan online, learn offline: Efficient learning and exploration via model-based control K Lowrey, A Rajeswaran, S Kakade, E Todorov, I Mordatch arXiv preprint arXiv:1811.01848, 2018 | 232 | 2018 |