Xizhou Zhu

Cited by

	All	Since 2019
Citations	15573	15370
h-index	34	34
i10-index	44	44

6000

3000

1500

4500

2018201920202021202220232024148 304 720 1597 2792 4242 5594

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Xizhou Zhu

Tsinghua University

Verified email at tsinghua.edu.cn


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deformable detr: Deformable transformers for end-to-end object detection X Zhu, W Su, L Lu, B Li, X Wang, J Dai arXiv preprint arXiv:2010.04159, 2020	5362	2020
Deformable convnets v2: More deformable, better results X Zhu, H Hu, S Lin, J Dai Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	2326	2019
Vl-bert: Pre-training of generic visual-linguistic representations W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai arXiv preprint arXiv:1908.08530, 2019	1843	2019
Deep feature flow for video recognition X Zhu, Y Xiong, J Dai, L Yuan, Y Wei Proceedings of the IEEE conference on computer vision and pattern …, 2017	837	2017
Flow-guided feature aggregation for video object detection X Zhu, Y Wang, J Dai, L Yuan, Y Wei Proceedings of the IEEE international conference on computer vision, 408-417, 2017	803	2017
Internimage: Exploring large-scale vision foundation models with deformable convolutions W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	619	2023
An empirical study of spatial attention mechanisms in deep networks X Zhu, D Cheng, Z Zhang, S Lin, J Dai Proceedings of the IEEE/CVF international conference on computer vision …, 2019	550	2019
Planning-oriented autonomous driving Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	431	2023
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems 36, 2024	342	2024
Towards high performance video object detection X Zhu, J Dai, L Yuan, Y Wei Proceedings of the IEEE conference on computer vision and pattern …, 2018	324	2018
Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision C Yang, Y Chen, H Tian, C Tao, X Zhu, Z Zhang, G Huang, H Li, Y Qiao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	213	2023
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024	135	2024
Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks X Zhu, J Zhu, H Li, X Wu, H Li, X Wang, J Dai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	123	2022
Spatially adaptive inference with stochastic feature sampling and interpolation Z Xie, Z Zhang, X Zhu, G Huang, S Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	113	2020
Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe H Li, C Sima, J Dai, W Wang, L Lu, H Wang, J Zeng, Z Li, J Yang, H Deng, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023	112	2023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, M Zhong, Q Zhang, X Zhu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	88	2024
An uncertainty-aware approach for exploratory microblog retrieval M Liu, S Liu, X Zhu, Q Liao, F Wei, S Pan IEEE transactions on visualization and computer graphics 22 (1), 250-259, 2015	81	2015
Siamese image modeling for self-supervised vision representation learning C Tao, X Zhu, W Su, G Huang, B Li, J Zhou, Y Qiao, X Wang, J Dai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	80	2023
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ... arXiv preprint arXiv:2305.05662, 2023	74	2023
Deformable kernels: Adapting effective receptive fields for object deformation H Gao, X Zhu, S Lin, J Dai arXiv preprint arXiv:1910.02940, 2019	73	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by