Visual Question Answering On Vcr Q Ar Dev

Accuracy

Results

Performance results of various models on this benchmark

Model Name	Accuracy	Paper Title
VL-BERTBASE	55.2	VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VL-BERTLARGE	58.9	VL-BERT: Pre-training of Generic Visual-Linguistic Representations
VisualBERT	52.2	VisualBERT: A Simple and Performant Baseline for Vision and Language

0 of 3 row(s) selected.