Bias Detection On Rt Inod Bias
评估指标
Best-of
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Best-of |
---|---|
benchmarking-llama2-mistral-gemma-and-gpt-for | 0.36 |
benchmarking-llama2-mistral-gemma-and-gpt-for | 0.34 |
benchmarking-llama2-mistral-gemma-and-gpt-for | 0.41 |
benchmarking-llama2-mistral-gemma-and-gpt-for | 0.41 |
benchmarking-llama2-mistral-gemma-and-gpt-for | 0.5 |