Soundlocd: An efficient conditional discrete contrastive latent diffusion model for text-to-sound generation

Published in ICASSP, 2024

Download Slides