Visual Question Answering On Vcr Q Ar Dev
Metrics
Accuracy
Results
Performance results of various models on this benchmark
Model Name | Accuracy | Paper Title | Repository |
---|---|---|---|
VL-BERTBASE | 55.2 | VL-BERT: Pre-training of Generic Visual-Linguistic Representations | |
VL-BERTLARGE | 58.9 | VL-BERT: Pre-training of Generic Visual-Linguistic Representations | |
VisualBERT | 52.2 | VisualBERT: A Simple and Performant Baseline for Vision and Language |
0 of 3 row(s) selected.