Copy reference, caption or embed code

Figure 3 - ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models

Figure 3: Human evaluation score distribution of four methods.
Human evaluation score distribution of four methods.
Go to figure page
Reference
Caption
Embed code