Visual Reasoning On Bongard Openworld
评估指标
2-Class Accuracy
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | 2-Class Accuracy |
---|---|
bongard-openworld-few-shot-reasoning-for-free | 91.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
cognitive-paradigms-for-evaluating-vlms-on | 92.8 |
cognitive-paradigms-for-evaluating-vlms-on | 93.6 |
bongard-openworld-few-shot-reasoning-for-free | 63.3 |
bongard-openworld-few-shot-reasoning-for-free | 55.5 |
bongard-openworld-few-shot-reasoning-for-free | 64.0 |
bongard-openworld-few-shot-reasoning-for-free | 49.3 |
bongard-openworld-few-shot-reasoning-for-free | 63.8 |