Cross-domain contrastive learning for unsupervised domain adaptation R Wang, Z Wu, Z Weng, J Chen, GJ Qi, YG Jiang TMM 2022, 2022 | 172 | 2022 |
To see is to believe: Prompting gpt-4v for better visual instruction tuning J Wang, L Meng, Z Weng, B He, Z Wu, YG Jiang Technical Report, 2023 | 68 | 2023 |
Videolt: Large-scale long-tailed video recognition X Zhang, Z Wu, Z Weng, H Fu, J Chen, YG Jiang, LS Davis ICCV 2021, 2021 | 46 | 2021 |
Open-vclip: Transforming clip to an open-vocabulary video model via interpolated weight optimization Z Weng, X Yang, A Li, Z Wu, YG Jiang ICML 2023, 2023 | 41 | 2023 |
Semi-supervised vision transformers Z Weng, X Yang, A Li, Z Wu, YG Jiang ECCV 2022, 2022 | 40 | 2022 |
Imbalanced gradients: a subtle cause of overestimated adversarial robustness X Ma, L Jiang, H Huang, Z Weng, J Bailey, YG Jiang Machine Learning 2024, 2024 | 25* | 2024 |
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition Z Weng, Z Wu, H Li, J Chen, YG Jiang TOMM 2023, 2023 | 14* | 2023 |
Building an open-vocabulary video CLIP model with better architectures, optimization and data Z Wu, Z Weng, W Peng, X Yang, A Li, LS Davis, YG Jiang TPAMI 2024, 2024 | 12 | 2024 |
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction Z Xing, Q Dai, Z Weng, Z Wu, YG Jiang arXiv preprint arXiv:2406.06465, 2024 | 8 | 2024 |
GenRec: Unifying Video Generation and Recognition with Diffusion Models Z Weng, X Yang, Z Xing, Z Wu, YG Jiang NeuIPS 2024, 2024 | 3 | 2024 |
A multimodal framework for video ads understanding Z Weng, L Meng, R Wang, Z Wu, YG Jiang ACM MM 2021, 2021 | 3 | 2021 |
Exploring the Consistency of Segment-level and Video-level Predictions for Improved Temporal Concept Localization in Videos Z Weng, R Wang, YG Jiang ICCV Workshop 2019, 2019 | 1 | 2019 |