Folgen
Hexiang (Frank) Hu
Hexiang (Frank) Hu
Sonstige NamenHexiang Hu, Frank Hu
xAI
Bestätigte E-Mail-Adresse bei x.ai - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
Technical Report, 2023
25172023
Few-shot learning via embedding adaptation with set-to-set functions
HJ Ye, H Hu, DC Zhan, F Sha
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
980*2020
Compressed Video Action Recognition
CY Wu, M Zaheer, H Hu, R Manmatha, AJ Smola, P Krähenbühl
Computer Vision and Pattern Recognition (CVPR), 2018 Proceedings of …, 2017
4162017
Structure inference machines: Recurrent neural networks for analyzing relations in group activity recognition
Z Deng, A Vahdat, H Hu, G Mori
Computer Vision and Pattern Recognition (CVPR), 2016 Proceedings of IEEE …, 2016
3032016
Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
R Vuorio, SH Sun, H Hu, JJ Lim
Advances in Neural Information Processing Systems (NeurIPS) 2019, 2019
302*2019
Learning the best pooling strategy for visual semantic embedding
J Chen, H Hu, H Wu, Y Jiang, C Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
2712021
Pix2Struct: Screenshot parsing as pretraining for visual language understanding
K Lee, M Joshi, I Turc, H Hu, F Liu, J Eisenschlos, U Khandelwal, P Shaw, ...
ICML 2023, 2023
2582023
Engaging image captioning via personality
K Shuster, S Humeau, H Hu, A Bordes, J Weston
Computer Vision and Pattern Recognition (CVPR), 2019 Proceedings of IEEE …, 2018
2012018
Learning structured inference neural networks with label relations
H Hu, GT Zhou, Z Deng, Z Liao, G Mori
Computer Vision and Pattern Recognition (CVPR), 2016 Proceedings of IEEE …, 2016
183*2016
On Scaling Up a Multilingual Vision and Language Model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
167*2024
Cross-Modal and Hierarchical Modeling of Video and Text
B Zhang, H Hu, F Sha
Proceedings of the European Conference on Computer Vision (ECCV), 2018
1672018
Re-imagen: Retrieval-augmented text-to-image generator
W Chen, H Hu, C Saharia, WW Cohen
ICLR 2023, 2022
1652022
Subject-driven text-to-image generation via apprenticeship learning
W Chen, H Hu, Y Li, N Ruiz, X Jia, MW Chang, WW Cohen
NeurIPS 2024 36, 2024
1642024
MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text
W Chen, H Hu, X Chen, P Verga, WW Cohen
EMNLP 2022, 2022
1232022
Learning Adaptive Classifiers Synthesis for Generalized Few-Shot Learning
HJ Ye, H Hu, DC Zhan
International Journal of Computer Vision, 2021
842021
Multi-Task Learning for Sequence Tagging: An Empirical Study
S Changpinyo, H Hu, F Sha
Proceedings of the International Conference on Computational Linguistics …, 2018
842018
BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps
W Zhu, H Hu, J Chen, Z Deng, V Jain, E Ie, F Sha
ACL 2020, 2020
812020
From pixels to ui actions: Learning to follow instructions via graphical user interfaces
P Shaw, M Joshi, J Cohan, J Berant, P Pasupat, H Hu, U Khandelwal, ...
NeurIPS 2024 36, 2024
672024
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Y Chen, H Hu, Y Luan, H Sun, S Changpinyo, A Ritter, MW Chang
EMNLP 2023, 2023
662023
Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets
WL Chao, H Hu, F Sha
The North American Chapter of the Association for Computational Linguistics …, 2018
61*2018
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20