音频分类
Audio Classification是一种机器学习任务,旨在对音频信号进行识别和分类,将其归入不同的类别。该任务的核心目标是使机器能够自动区分各种类型的音频,如音乐、语音和环境声音,从而在音频处理和分析中发挥关键作用。通过精准的音频分类,可以提升音频检索、监控和内容管理系统的效率与准确性,具有重要的应用价值。
AudioSet
MAViL (Audio-Visual, single)
ESC-50
BEATs
ICBHI Respiratory Sound Database
BTS
VGGSound
ONE-PEACE (Audio-Visual)
SHD
SNN with Dilated Convolution with Learnable Spacings
FSD50K
Balanced Audio Set
EquiAV
Speech Commands
EAT
DCASE
CrissCross (AudioSet)
SSC
Event-SSM
BirdCLEF 2021
EPIC-KITCHENS-100
Audiovisual Masked Autoencoder
(Audiovisual, Single)
Audio Set
CREMA-D
DiCOVA
RAVDESS
VocalSound
VocalSound Baseline
DEEP-VOICE: DeepFake Voice Recognition
EPIC-SOUNDS
MeerKAT: Meerkat Kalahari Audio Transcripts
animal2vec
Multimodal PISA
UCR Time Series Classification Archive
CDIL
audiofolder
Common Voice 16.1
LSVSC
MNIST