Video Question Answering On Sutd Trafficqa
Metrics
1/4
Results
Performance results of various models on this benchmark
Model Name | 1/4 | Paper Title | Repository |
---|---|---|---|
Tem-adapter | 46.0 | Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer | |
TVQA | 35.16 | TVQA: Localized, Compositional Video Question Answering | |
CFMMC-Align | 50.2 | - | - |
HCRN | 36.49 | Hierarchical Conditional Relation Networks for Video Question Answering | |
Eclipse | 37.05 | SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events | |
VIS+LST | 29.91 | Exploring Models and Data for Image Question Answering |
0 of 6 row(s) selected.