Automated Theorem Proving On Minif2F Valid
评估指标
Pass@64
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Pass@64 |
---|---|
hypertree-proof-search-for-neural-theorem | 47.3 |
hypertree-proof-search-for-neural-theorem | 46.7 |
minif2f-a-cross-system-benchmark-for-formal | - |
minif2f-a-cross-system-benchmark-for-formal | - |
hypertree-proof-search-for-neural-theorem | 47.5 |
hypertree-proof-search-for-neural-theorem | 58.6 |
minif2f-a-cross-system-benchmark-for-formal | - |
draft-sketch-and-prove-guiding-formal-theorem | - |
lyra-orchestrating-dual-correction-in | - |
lego-prover-neural-theorem-proving-with | - |