Visual Question Answering On Visual7W
Metrics
Percentage correct
Results
Performance results of various models on this benchmark
Model Name | Percentage correct | Paper Title | Repository |
---|---|---|---|
CFR | 71.9 | Coarse-to-Fine Reasoning for Visual Question Answering | |
CTI (with Boxes) | 72.3 | Compact Trilinear Interaction for Visual Question Answering | |
CMN | 72.53 | Modeling Relationships in Referential Expressions with Compositional Modular Networks | |
MCB+Att. | 62.2 | Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding |
0 of 4 row(s) selected.