Question Answering On Bioasq
评估指标
Accuracy
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Accuracy |
---|---|
linkbert-pretraining-language-models-with | 91.4 |
galactica-a-large-language-model-for-science-1 | 94.3 |
domain-specific-language-model-pretraining | 87.56 |
linkbert-pretraining-language-models-with | 94.8 |
galactica-a-large-language-model-for-science-1 | 91.4 |
evaluation-of-large-language-model | 85.71 |
galactica-a-large-language-model-for-science-1 | 81.4 |