Folgen
Tengda Han
Tengda Han
Visual Geometry Group, University of Oxford
Bestätigte E-Mail-Adresse bei robots.ox.ac.uk - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Flamingo: a visual language model for few-shot learning
JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ...
Advances in Neural Information Processing Systems 35, 23716-23736, 2022
10992022
Video representation learning by dense predictive coding
T Han, W Xie, A Zisserman
Workshop on Large-scale Holistic Video Understanding, ICCV, 2019
3862019
Self-supervised Co-training for Video Representation Learning
T Han, W Xie, A Zisserman
Conference on Neural Information Processing Systems (NeurIPS), 2020
3662020
Memory-augmented Dense Predictive Coding for Video Representation Learning
T Han, W Xie, A Zisserman
European Conference on Computer Vision (ECCV), 2020, 2020
2312020
Prompting visual-language models for efficient video understanding
C Ju, T Han, K Zheng, Y Zhang, W Xie
European Conference on Computer Vision, 105-124, 2022
1762022
Human pose forecasting via deep markov models
S Toyer, A Cherian, T Han, S Gould
2017 International Conference on Digital Image Computing: Techniques and …, 2017
502017
Temporal Alignment Networks for Long-term Video
T Han, W Xie, A Zisserman
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 2022
422022
WhisperX: Time-accurate speech transcription of long-form audio
M Bain, J Huh, T Han, A Zisserman
arXiv preprint arXiv:2303.00747, 2023
282023
Human action forecasting by learning task grammars
T Han, J Wang, A Cherian, S Gould
arXiv preprint arXiv:1709.06391, 2017
172017
AutoAD: Movie Description in Context
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
82023
Prompt generation networks for efficient adaptation of frozen vision transformers
J Loedeman, MC Stol, T Han, YM Asano
arXiv preprint arXiv:2210.06466, 2022
82022
AutoAD II: The Sequel-Who, When, and What in Movie Audio Description
T Han, M Bain, A Nagrani, G Varol, W Xie, A Zisserman
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
22023
Turbo training with token dropout
T Han, W Xie, A Zisserman
arXiv preprint arXiv:2210.04889, 2022
22022
Semantic Counting from Self-Collages
L Knobel, T Han, YM Asano
arXiv preprint arXiv:2307.08727, 2023
2023
Open-world text-specified object counting
N Amini-Naieni, K Amini-Naieni, T Han, A Zisserman
British Machine Vision Association, 2023
2023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–15