HyperAI超神经

Visual Reasoning On Bongard Openworld

评估指标

2-Class Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称2-Class Accuracy
bongard-openworld-few-shot-reasoning-for-free91.0
bongard-openworld-few-shot-reasoning-for-free49.3
cognitive-paradigms-for-evaluating-vlms-on92.8
cognitive-paradigms-for-evaluating-vlms-on93.6
bongard-openworld-few-shot-reasoning-for-free63.3
bongard-openworld-few-shot-reasoning-for-free55.5
bongard-openworld-few-shot-reasoning-for-free64.0
bongard-openworld-few-shot-reasoning-for-free49.3
bongard-openworld-few-shot-reasoning-for-free63.8