Cvt: Introducing convolutions to vision transformers H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang Proceedings of the IEEE/CVF international conference on computer vision, 22-31, 2021 | 2265 | 2021 |
Cswin transformer: A general vision transformer backbone with cross-shaped windows X Dong, J Bao, D Chen, W Zhang, N Yu, L Yuan, D Chen, B Guo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 1137 | 2022 |
Dynamic convolution: Attention over convolution kernels Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 1098 | 2020 |
Grounded language-image pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 1055 | 2022 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 893 | 2021 |
Image completion with structure propagation J Sun, L Yuan, J Jia, HY Shum ACM Siggraph 2005 Papers, 861-868, 2005 | 889 | 2005 |
Image deblurring with blurred/noisy image pairs L Yuan, J Sun, L Quan, HY Shum ACM SIGGRAPH 2007 papers, 1-es, 2007 | 876 | 2007 |
Deep feature flow for video recognition X Zhu, Y Xiong, J Dai, L Yuan, Y Wei Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 856 | 2017 |
Flow-guided feature aggregation for video object detection X Zhu, Y Wang, J Dai, L Yuan, Y Wei Proceedings of the IEEE international conference on computer vision, 408-417, 2017 | 823 | 2017 |
Gated context aggregation network for image dehazing and deraining D Chen, M He, Q Fan, J Liao, L Zhang, D Hou, L Yuan, G Hua 2019 IEEE winter conference on applications of computer vision (WACV), 1375-1383, 2019 | 808 | 2019 |
Vector quantized diffusion model for text-to-image synthesis S Gu, D Chen, J Bao, F Wen, B Zhang, D Chen, L Yuan, B Guo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 795 | 2022 |
Bidirectional learning for domain adaptation of semantic segmentation Y Li, L Yuan, N Vasconcelos Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 760 | 2019 |
Rethinking classification and localization for object detection Y Wu, Y Chen, L Yuan, Z Liu, L Wang, H Li, Y Fu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 724 | 2020 |
Dynamic head: Unifying object detection heads with attentions X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 677 | 2021 |
Stylebank: An explicit representation for neural image style transfer D Chen, L Yuan, J Liao, N Yu, G Hua Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 582 | 2017 |
Mobile-former: Bridging mobilenet and transformer Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 580 | 2022 |
Visual attribute transfer through deep image analogy J Liao, Y Yao, L Yuan, G Hua, SB Kang arXiv preprint arXiv:1705.01088, 2017 | 547 | 2017 |
Regionclip: Region-based language-image pretraining Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 544 | 2022 |
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... arXiv preprint arXiv:2404.14219, 2024 | 541 | 2024 |
Focal self-attention for local-global interactions in vision transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao arXiv preprint arXiv:2107.00641, 2021 | 483 | 2021 |