Visual Question Answering Vqa On Pmc Vqa
Metrics
Accuracy
Results
Performance results of various models on this benchmark
Model Name | Accuracy | Paper Title | Repository |
---|---|---|---|
BLIP-2 | 24.3 | BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | |
Open-Flamingo | 26.4 | Flamingo: a Visual Language Model for Few-Shot Learning | |
PMC-CLIP | 24.7 | PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents | |
MedVInT | 42.3 | PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering |
0 of 4 row(s) selected.