Vmamba: Visual state space model Y Liu, Y Tian, Y Zhao, H Yu, L Xie, Y Wang, Q Ye, Y Liu arXiv preprint arXiv:2401.10166, 2024 | 94 | 2024 |
Graformer: Graph-oriented transformer for 3d pose estimation W Zhao, W Wang, Y Tian Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 68 | 2022 |
Graformer: Graph convolution transformer for 3d pose estimation W Zhao, Y Tian, Q Ye, J Jiao, W Wang arXiv preprint arXiv:2109.08364, 2021 | 26 | 2021 |
Discretization-aware architecture search Y Tian, C Liu, L Xie, Q Ye Pattern Recognition 120, 108186, 2021 | 25 | 2021 |
Adaptive linear span network for object skeleton detection C Liu, Y Tian, Z Chen, J Jiao, Q Ye IEEE transactions on image processing 30, 5096-5108, 2021 | 25 | 2021 |
Hivit: A simpler and more efficient design of hierarchical vision transformer X Zhang, Y Tian, L Xie, W Huang, Q Dai, Q Ye, Q Tian The Eleventh International Conference on Learning Representations, 2022 | 23 | 2022 |
Hivit: Hierarchical vision transformer meets masked image modeling X Zhang, Y Tian, W Huang, Q Ye, Q Dai, L Xie, Q Tian arXiv preprint arXiv:2205.14949, 2022 | 18 | 2022 |
Integrally pre-trained transformer pyramid networks Y Tian, L Xie, Z Wang, L Wei, X Zhang, J Jiao, Y Wang, Q Tian, Q Ye Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 14 | 2023 |
Semantic-aware generation for self-supervised visual representation learning Y Tian, L Xie, X Zhang, J Fang, H Xu, W Huang, J Jiao, Q Tian, Q Ye arXiv preprint arXiv:2111.13163, 2021 | 12 | 2021 |
Beyond masking: Demystifying token-based pre-training for vision transformers Y Tian, L Xie, J Fang, M Shi, J Peng, X Zhang, J Jiao, Q Tian, Q Ye arXiv preprint arXiv:2203.14313, 2022 | 10 | 2022 |
Genetic feature fusion for object skeleton detection Y Qiao, Y Tian, Y Liu, J Jiao Security and Communication Networks 2021, 1-9, 2021 | 5 | 2021 |
Spatial transform decoupling for oriented object detection H Yu, Y Tian, Q Ye, Y Liu Proceedings of the AAAI Conference on Artificial Intelligence 38 (7), 6782-6790, 2024 | 4 | 2024 |
ChatterBox: Multi-round Multimodal Referring and Grounding Y Tian, T Ma, L Xie, J Qiu, X Tang, Y Zhang, J Jiao, Q Tian, Q Ye arXiv preprint arXiv:2401.13307, 2024 | 1 | 2024 |
Exploring Complicated Search Spaces with Interleaving-Free Sampling Y Tian, L Xie, J Fang, J Jiao, Q Ye, Q Tian arXiv preprint arXiv:2112.02488, 2021 | | 2021 |
Image Compression Is an Effective Objective for Visual Representation Learning Y Tian, L Xie | | |
Integrally Pre-Trained Transformer Pyramid Networks Supplementary Materials Y Tian, L Xie, Z Wang, L Wei, X Zhang, J Jiao, Y Wang, Q Tian, Q Ye | | |