HyperAI超神经

Hellobench

评估指标

average
chat-rescaled score
heuristic text generation-rescaled score
llm_model
model_url
open-ended qa-rescaled score
organization
parameters
release_date
summarization-rescaled score
text completion-rescaled score
updated_time

评测结果

各个模型在此基准测试上的表现结果

比较表格
模型名称averagechat-rescaled scoreheuristic text generation-rescaled scorellm_modelmodel_urlopen-ended qa-rescaled scoreorganizationparametersrelease_datesummarization-rescaled scoretext completion-rescaled scoreupdated_time
模型 148.5542.8847.87GPT-4o-2024-08-06https://platform.openai.com/docs/guides54.82OpenAIN/A2024/8/629.7167.492024/9/24