Natural Language Inference On Multinli Dev
评估指标
Matched
Mismatched
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Matched | Mismatched |
---|---|---|
prune-once-for-all-sparse-pre-trained | 78.8 | 80.4 |
prune-once-for-all-sparse-pre-trained | 81.4 | 82.51 |
prune-once-for-all-sparse-pre-trained | 82.71 | 83.67 |
prune-once-for-all-sparse-pre-trained | 80.68 | 81.47 |
prune-once-for-all-sparse-pre-trained | 80.66 | 81.14 |
prune-once-for-all-sparse-pre-trained | 83.74 | 84.2 |
prune-once-for-all-sparse-pre-trained | 81.45 | 82.43 |
190910351 | 84.5 | 84.5 |
prune-once-for-all-sparse-pre-trained | 83.47 | 84.08 |
prune-once-for-all-sparse-pre-trained | 81.35 | 82.03 |