Подписаться
Kevin Lin
Kevin Lin
Подтвержден адрес электронной почты в домене microsoft.com - Главная страница
Название
Процитировано
Процитировано
Год
Deep learning of binary hash codes for fast image retrieval
K Lin, HF Yang, JH Hsiao, CS Chen
IEEE Conference on Computer Vision and Pattern Recognition Workshops, 27-35, 2015
7632015
End-to-end human pose and mesh reconstruction with transformers
K Lin, L Wang, Z Liu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 1954-1963, 2021
6902021
GIT: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
Transactions on Machine Learning Research (TMLR), 2022
4652022
Adversarial ranking for language generation
K Lin, D Li, X He, Z Zhang, MT Sun
Advances in Neural Information Processing Systems (NeurIPS), 3158-3168, 2017
4292017
Learning compact binary descriptors with unsupervised deep neural networks
K Lin, J Lu, CS Chen, J Zhou
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1183-1192, 2016
4232016
Supervised learning of semantics-preserving hash via deep convolutional neural networks
HF Yang, K Lin, CS Chen
IEEE Transactions on Pattern Analysis and Machine Intelligence 40 (2), 437-451, 2018
4012018
The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang
arXiv preprint arXiv:2309.17421, 2023
3902023
Mesh graphormer
K Lin, L Wang, Z Liu
IEEE/CVF International Conference on Computer Vision (ICCV), 12939-12948, 2021
3292021
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
2702023
Mm-vet: Evaluating large multimodal models for integrated capabilities
W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang
ICML 2024, 2024
2612024
Mitigating hallucination in large multi-modal models via robust instruction tuning
F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang
ICLR 2024, 2024
251*2024
SwinBERT: End-to-end transformers with sparse attention for video captioning
K Lin, L Li, CC Lin, F Ahmed, Z Gan, Z Liu, Y Lu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 17949 …, 2022
2422022
VIOLET: End-to-end video-language transformers with masked visual-token modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv preprint arXiv:2111.12681, 2021
2002021
Abandoned object detection via temporal consistency modeling and back-tracing verification for visual surveillance
K Lin, SC Chen, CS Chen, DTD Lin, YP Hung
IEEE Transactions on Information Forensic and Security 10 (7), 1359-1370, 2015
1172015
Vivo: Visual vocabulary pre-training for novel object captioning
X Hu, X Yin, K Lin, L Zhang, J Gao, L Wang, Z Liu
Proceedings of the AAAI Conference on Artificial Intelligence, 1575-1583, 2021
114*2021
Reco: Region-controlled text-to-image generation
Z Yang, J Wang, Z Gan, L Li, K Lin, C Wu, N Duan, Z Liu, C Liu, M Zeng, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14246 …, 2023
972023
Cross-domain complementary learning using pose for multi-person part segmentation
K Lin, L Wang, K Luo, Y Chen, Z Liu, MT Sun
IEEE Transactions on Circuits and Systems for Video Technology 31 (3), 1066 …, 2020
972020
Rapid clothing retrieval via deep learning of binary codes and hierarchical search
K Lin, HF Yang, KH Liu, JH Hsiao, CS Chen
ACM International Conference on Multimedia Retrieval (ICMR), 499–502, 2015
912015
Lavender: Unifying video-language understanding as masked language modeling
L Li, Z Gan, K Lin, CC Lin, Z Liu, C Liu, L Wang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 23119 …, 2023
792023
Unsupervised deep learning of compact binary descriptors
K Lin, J Lu, CS Chen, J Zhou, MT Sun
IEEE Transactions on Pattern Analysis and Machine Intelligence 41 (6), 1501-1514, 2019
782019
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–20