HyperAI超神经

Code Generation On Codecontests

评估指标

Test Set pass@1
Test Set pass@5
Val Set pass@1

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称Test Set pass@1Test Set pass@5Val Set pass@1
mapcoder-multi-agent-code-generation-for28.535.228.5
planning-driven-programming-a-large-language34.7--
wizardcoder-empowering-code-large-language1.113.181.98
motcoder-elevating-large-language-models-with20.77-16.72
codesim-multi-agent-code-generation-and-129.1--
codechain-towards-modular-code-generation2.353.292.48
motcoder-elevating-large-language-models-with26.34-20.35