文本摘要
文本摘要(Text Summarization)是自然语言处理的一项任务,旨在将长篇文档压缩成更简短精炼的版本,同时保留原文的核心信息与意义。其目标是生成能够准确反映原始内容的概要,以便用户快速获取关键信息。该任务包括抽取式方法和生成式方法,前者通过识别并提取重要句子或短语,后者则基于原文内容生成新的文本。文本摘要在新闻报道、科研文献、商业报告等领域具有重要应用价值。
GigaWord
BART-RXF
Pubmed
Arxiv HEP-TH citation graph
MTEB
X-Sum
Selfmem
CNN / Daily Mail (Anonymized)
DUC 2004 Task 1
Transformer+WDrop
SAMSum
Reddit TIFU
arXiv Summarization Dataset
PRIMER
DialogSum
InstructDS
Klexikon
Luhn's algorithm (25 sentences)
BookSum
Echoes-Extractive-Abstractive
GigaWord-10k
ERNIE-GENLARGE (large-scale text corpora)
WikiHow
BertSum
BigPatent
BigBird-Pegasus
GovReport
FactorSum
How2
MeetingBank
OrangeSum
mBARThez (OrangeSum abstract)
ACI-Bench
CriSPO 3-shot
AMI
arXiv
BigBird-Pegasus
BBC XSum
MatchSum
BillSum
Longformer Encoder Decoder
CL-SciSumm
CORD-19
EurekaAlert
Gazeta
Finetuned mBART
LCSTS
LSTM-seq2seq
MediaSum
SRformer-BART
MentSum
MeQSum
BiomedGPT
QMSum
BART-LS
S2ORC
GenCompareSum
Webis-Snippet-20 Corpus
Anchor-context + Query biased
XSum
SRformer-BART