HyperAI

Audio

Audio technology refers to the techniques for processing, analyzing, and synthesizing sound through computer systems. Its goal is to achieve high-quality sound signal processing, enhance auditory experiences, and support applications such as speech recognition and audio enhancement. Audio technology has significant application value in smart devices, online education, entertainment industries, and more, significantly improving user experience and promoting the naturalization and intelligence of human-computer interaction.

Audio Classification

91 papers | 26 benchmarks

Beat Tracking

1 papers | 15 benchmarks

Downbeat Tracking

1 papers | 13 benchmarks

Few-Shot Audio Classification

2 papers | 10 benchmarks

Bandwidth Extension

2 papers | 6 benchmarks

Language Identification

5 papers | 6 benchmarks

Sound Event Detection

16 papers | 5 benchmarks

Speech Synthesis

19 papers | 5 benchmarks

Acoustic Scene Classification

5 papers | 5 benchmarks

Sound Event Localization and Detection

5 papers | 5 benchmarks

Audio Super-Resolution

8 papers | 4 benchmarks

Environmental Sound Classification

3 papers | 3 benchmarks

Voice Conversion

3 papers | 3 benchmarks

Audio Generation

22 papers | 3 benchmarks

Instrument Recognition

5 papers | 3 benchmarks

Voice Anti-spoofing

6 papers | 3 benchmarks

Audio Denoising

1 papers | 3 benchmarks

Target Sound Extraction

2 papers | 3 benchmarks

Music Source Separation

25 papers | 3 benchmarks

Text-to-Music Generation

15 papers | 2 benchmarks

Audio captioning

18 papers | 2 benchmarks

Audio Source Separation

2 papers | 2 benchmarks

Zero-shot Audio Captioning

3 papers | 2 benchmarks

Direction of Arrival Estimation

1 papers | 1 benchmarks

Audio Tagging

9 papers | 1 benchmarks

Audio Quality Assessment

1 papers | 1 benchmarks

Video-to-Sound Generation

7 papers | 1 benchmarks

Retrieval-augmented Few-shot In-context Audio Captioning

5 papers | 1 benchmarks

Active Speaker Localization

1 papers | 1 benchmarks

Acoustic Novelty Detection

3 papers | 1 benchmarks

Lung Sound Classification

3 papers | 1 benchmarks

fake voice detection

1 papers | 1 benchmarks

Cadenza 1 - Task 1 - Headphone

1 papers | 1 benchmarks

Cadenza 1 - Task 2 - In Car

1 papers | 1 benchmarks

Directional Hearing

1 papers | 1 benchmarks

Music Generation

1 papers | 1 benchmarks

Real-time Directional Hearing

1 papers | 1 benchmarks

Streaming Target Sound Extraction

1 papers | 1 benchmarks