Language Modelling On Salmon
评估指标
Background (Domain) Consistency
Background (Random) Consistency
Background Alignment
Gender Consistency
Room Consistency
Sentiment Alignment
Sentiment Consistency
Speaker Consistency
评测结果
各个模型在此基准测试上的表现结果
比较表格
模型名称 | Background (Domain) Consistency | Background (Random) Consistency | Background Alignment | Gender Consistency | Room Consistency | Sentiment Alignment | Sentiment Consistency | Speaker Consistency |
---|---|---|---|---|---|---|---|---|
spirit-lm-interleaved-spoken-and-written | 55.0 | 64.0 | 59.5 | 85.0 | 54.5 | 52.0 | 73.5 | 81.0 |
last-language-model-aware-speech-tokenization | 55.5 | 60.5 | 54.5 | 70.5 | 61.0 | 51.5 | 64.0 | 63.0 |
textually-pretrained-speech-language-models | 55.5 | 60.5 | 56.5 | 69.5 | 59.0 | 53.0 | 61.5 | 69.0 |
textually-pretrained-speech-language-models | 54.0 | 61.5 | 56.5 | 68.0 | 59.0 | 51.5 | 59.0 | 69.5 |
text-free-prosody-aware-generative-spoken | 57.0 | 66.0 | 53.5 | 88.5 | 53.5 | 55.5 | 40.5 | 83.0 |
last-language-model-aware-speech-tokenization | 56.0 | 61.0 | 53.0 | 68.5 | 62.5 | 53.5 | 65.0 | 64.5 |
textually-pretrained-speech-language-models | 55.0 | 60.5 | 54.5 | 70.0 | 62.0 | 51.5 | 61.5 | 71.0 |
spirit-lm-interleaved-spoken-and-written | 53.5 | 55.5 | 51.5 | 67.0 | 54.5 | 48.0 | 54.5 | 69.5 |