HyperAI超神经

Mathematical Reasoning On Lila Iid

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Accuracy
lila-a-unified-benchmark-for-mathematical0.252
lila-a-unified-benchmark-for-mathematical0.394
lila-a-unified-benchmark-for-mathematical0.48
lila-a-unified-benchmark-for-mathematical0.384
lila-a-unified-benchmark-for-mathematical0.204
lila-a-unified-benchmark-for-mathematical0.604