Follow
Roy Ganz
Roy Ganz
Ph.D student, Technion
Verified email at campus.technion.ac.il - Homepage
Title
Cited by
Cited by
Year
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions
N Rotstein, D Bensaid, S Brody, R Ganz, R Kimmel
arXiv preprint arXiv:2305.17718, 2023
542023
Enhancing diffusion-based image synthesis with robust classifier guidance
B Kawar, R Ganz, M Elad
Transactions on Machine Learning Research, 2022
412022
Threat model-agnostic adversarial defense using diffusion models
T Blau, R Ganz, B Kawar, A Bronstein, M Elad
arXiv preprint arXiv:2207.08089, 2022
342022
Multimodal semi-supervised learning for text recognition
A Aberdam, R Ganz, S Mazor, R Litman
arXiv preprint arXiv:2205.03873, 2022
262022
Clipter: Looking at the bigger picture in scene text recognition
A Aberdam, D Bensaīd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
202023
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
R Ganz, B Kawar, M Elad
ICML 2023, 2022
19*2022
Question aware vision transformer for multimodal reasoning
R Ganz, Y Kittenplon, A Aberdam, E Ben Avraham, O Nuriel, S Mazor, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
172024
Towards models that can see and read
R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman
Proceedings of the IEEE/CVF international conference on computer vision …, 2023
152023
Clipag: Towards generator-free text-to-image generation
R Ganz, M Elad
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
112024
GRAM: Global reasoning for multi-page VQA
T Blau, S Fogel, R Ronen, A Golts, R Ganz, E Ben Avraham, A Aberdam, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
BIGRoC: Boosting Image Generation via a Robust Classifier
R Ganz, M Elad
Transactions on Machine Learning Research, 2021
82021
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
N Wasserman, N Rotstein, R Ganz, R Kimmel
arXiv preprint arXiv:2404.18212, 2024
72024
Classifier robustness enhancement via test-time transformation
T Blau, R Ganz, C Baskin, M Elad, A Bronstein
arXiv preprint arXiv:2303.15409, 2023
72023
Improved Image Generation via Sparse Modeling
R Ganz, M Elad
ICLR Workshop on Deep Generative Models for Highly Structured Data, 2021
2*2021
Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination
S Golan, R Ganz, M Elad
arXiv preprint arXiv:2405.16260, 2024
12024
DocVLM: Make Your VLM an Efficient Reader
MS Nacson, A Aberdam, R Ganz, EB Avraham, A Golts, Y Kittenplon, ...
arXiv preprint arXiv:2412.08746, 2024
2024
DocVLM: Make Your VLM an Efficient Reader
M Shpigel Nacson, A Aberdam, R Ganz, E Ben Avraham, A Golts, ...
arXiv e-prints, arXiv: 2412.08746, 2024
2024
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
J Fhima, EB Avraham, O Nuriel, Y Kittenplon, R Ganz, A Aberdam, ...
arXiv preprint arXiv:2411.04642, 2024
2024
Text-to-Image Generation Via Energy-Based CLIP
R Ganz, M Elad
arXiv preprint arXiv:2408.17046, 2024
2024
Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness
M Ehrenberg, R Ganz, N Rosenfeld
arXiv preprint arXiv:2406.11458, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20