Descript
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Python 1.7k 171
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Python 1k 213
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
Python 356 28
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Python 339 74
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Python 192 30