Multi-evidence filtering and fusion for multi-label classification, object detection and semantic segmentation based on weakly supervised learning W Ge, S Yang, Y Yu Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 248 | 2018 |
Dynamic graph attention for referring expression comprehension S Yang, G Li, Y Yu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 226 | 2019 |
Cross-modal relationship inference for grounding referring expressions S Yang, G Li, Y Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 124 | 2019 |
Preservational learning improves self-supervised medical image models by reconstructing diverse contexts HY Zhou, C Lu, S Yang, X Han, Y Yu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 103 | 2021 |
Graph-structured referring expression reasoning in the wild S Yang, G Li, Y Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 94 | 2020 |
Bottom-up shift and reasoning for referring image segmentation S Yang, M Xia, G Li, HY Zhou, Y Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 86 | 2021 |
Non-local context encoder: Robust biomedical image segmentation against adversarial attacks X He, S Yang, G Li, H Li, H Chang, Y Yu Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8417-8424, 2019 | 80 | 2019 |
Convnets vs. transformers: Whose visual representations are more transferable? HY Zhou, C Lu, S Yang, Y Yu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 65 | 2021 |
DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance L Zhang, Q Qiu, H Lin, Q Zhang, C Shi, W Yang, Y Shi, S Yang, L Xu, J Yu SIGGRAPH, 2023 | 62 | 2023 |
A survey on graph neural networks and graph transformers in computer vision: a task-oriented perspective C Chen, Y Wu, Q Dai, HY Zhou, M Xu, S Yang, X Han, Y Yu TPAMI 2024, 2024 | 54 | 2024 |
Relationship-embedded representation learning for grounding referring expressions S Yang, G Li, Y Yu IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (8), 2765-2779, 2020 | 53 | 2020 |
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models G Zheng, B Yang, J Tang, HY Zhou, S Yang NeurIPS 2023, 2023 | 52 | 2023 |
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator H Huang, Y Feng, C Shi, L Xu, J Yu, S Yang NeurIPS 2023, 2023 | 46 | 2023 |
A unified visual information preservation framework for self-supervised pre-training in medical image analysis HY Zhou, C Lu, C Chen, S Yang, Y Yu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (7), 8020-8035, 2023 | 39 | 2023 |
Contrastive grouping with transformer for referring image segmentation J Tang, G Zheng, C Shi, S Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 35 | 2023 |
Propagating over phrase relations for one-stage visual grounding S Yang, G Li, Y Yu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 32 | 2020 |
EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment C Shi, S Yang IEEE International Conference on Computer Vision (ICCV), 2023 | 31 | 2023 |
Structured attention network for referring image segmentation L Lin, P Yan, X Xu, S Yang, K Zeng, G Li IEEE Transactions on Multimedia 24, 1922-1932, 2021 | 30 | 2021 |
TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition M Lou, HY Zhou, S Yang, Y Yu arXiv preprint arXiv:2310.19380, 2023 | 21 | 2023 |
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models C Shi, S Yang IEEE International Conference on Computer Vision (ICCV), 2023 | 20 | 2023 |