Meta-Transformer: A Unified Framework for Multimodal Learning Y Zhang, K Gong, K Zhang, H Li, Y Qiao, W Ouyang, X Yue arXiv preprint arXiv:2307.10802, 2023 | 62 | 2023 |
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification Y Zhang, S Zhao, Y Kang, J Shen European Conference on Computer Vision (ECCV), 462-479, 2022 | 28 | 2022 |
Dual-semantic consistency learning for visible-infrared person re-identification Y Zhang, Y Kang, S Zhao, J Shen IEEE Transactions on Information Forensics and Security 18, 1554-1565, 2022 | 16 | 2022 |
UniRepLKNet: A Universal Perception Large-Kernel Convnet for Audio, Video, Point cloud, Time-series and Image Recognition X Ding, Y Zhang, Y Ge, S Zhao, L Song, X Yue, Y Shan IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 11 | 2024 |
OneLLM: One Framework to Align All Modalities with Language J Han, K Gong, Y Zhang, J Wang, K Zhang, D Lin, Y Qiao, P Gao, X Yue IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 10 | 2024 |
Online Vectorized HD Map Construction using Geometry Z Zhang, Y Zhang, X Ding, F Jin, X Yue arXiv preprint arXiv:2312.03341, 2023 | 3 | 2023 |
Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors L Ding, S Dong, Z Huang, Z Wang, Y Zhang, K Gong, D Xu, T Xue IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 2 | 2024 |
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities Y Zhang, X Ding, K Gong, Y Ge, Y Shan, X Yue IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 1 | 2024 |