Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ... European Conference on Computer Vision (ECCV), 2020 | 938* | 2020 |
Large Scale Incremental Learning YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 | 629* | 2019 |
VinVL: Making Visual Representations Matter in Vision-Language Models P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao CVPR2021, 2021 | 444* | 2021 |
Refining of segmental boundaries in speech waveforms using contextual-dependent models Y Zhao, M Chu, JL Zhou, L Wang US Patent 7,496,512, 2009 | 334 | 2009 |
Rethinking Classification and Localization for Object Detection YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 294 | 2020 |
Handwriting-based user interface for correction of speech recognition errors L Wang, FKP Soong US Patent App. 12/042,344, 2009 | 275 | 2009 |
End-to-End Human Pose and Mesh Reconstruction with Transformers K Lin, L Wang, Z Liu CVPR2021, 2020 | 237 | 2020 |
Real-time Animation for an Expressive Avatar N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou US Patent App. 12/950,801, 2012 | 201 | 2012 |
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 182 | 2021 |
End-to-end semi-supervised object detection with soft teacher M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 167 | 2021 |
Speech and text driven HMM-based body animation synthesis L Wang, L Ma, FKP Soong US Patent 8,224,652, 2012 | 162 | 2012 |
PHOTO-REAL TALKING HEAD WITH DEEP BIDIRECTIONAL LSTM B Fan, L Wang, FK Soong, L Xie ICASSP, 2015 | 121 | 2015 |
Incremental classifier learning with generative adversarial networks Y Wu, Y Chen, L Wang, Y Ye, Z Liu, Y Guo, Z Zhang, Y Fu arXiv preprint arXiv:1802.00853, 2018 | 103 | 2018 |
SEED: Self-supervised Distillation For Visual Representation Z Fang, J Wang, L Wang, L Zhang, Y Yang, Z Liu ICLR 2021, 2021 | 95 | 2021 |
Grounded language-image pre-training LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 84 | 2022 |
An empirical study of training end-to-end vision-and-language transformers ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 80 | 2022 |
Mesh graphormer K Lin, L Wang, Z Liu Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 74 | 2021 |
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption Z Yang, Y Lu, J Wang, X Yin, D Florencio, L Wang, C Zhang, L Zhang, ... CVPR2021, 2020 | 70 | 2020 |
Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation MTS Kevin Lin, Lijuan Wang, Kun Luo, Yinpeng Chen, Zicheng Liu IEEE Transactions on Circuits and Systems for Video Technology, 2020 | 61 | 2020 |
Scaling up vision-language pre-training for image captioning X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 59 | 2022 |