Dialogue Generation On Fusedchat
评估指标
BLEU
Inform
Inform_mct
Joint SA
PPL
SSA
Sensibleness
Slot Accuracy
Specificity
Success
Success_mct
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | BLEU | Inform | Inform_mct | Joint SA | PPL | SSA | Sensibleness | Slot Accuracy | Specificity | Success | Success_mct |
---|---|---|---|---|---|---|---|---|---|---|---|
fusing-task-oriented-and-open-domain | 12.05 | 70.4 | 90.1 | 0.592 | 10.49 | 0.50 | 0.52 | 0.972 | 0.47 | 57.0 | 72.7 |
fusing-task-oriented-and-open-domain | 12.17 | 75.1 | 90.8 | 0.600 | 10.50 | 0.55 | 0.58 | 0.973 | 0.51 | 60.9 | 74.4 |