Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
portfolio
publications
Spatial-temporal-class attention network for acoustic scene classification
Published in ICME, 2022
Oral presentation
Soundlocd: An efficient conditional discrete contrastive latent diffusion model for text-to-sound generation
Published in ICASSP, 2024
Oral presentation
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming
Published in ICML, 2024
HybridVC: Efficient Voice Style Conversion With Text and Audio Prompts
Published in InterSpeech, 2024
SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Published in In submission, 2024
In submission
Beyond Video-to-SFX: Video to Audio Synthesis with Environmentally Aware Speech
Published in In submission, 2025
In submission
SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music Editing
Published in AAAI, 2025
Oral presentation
talks
Oral presentation
Published:
Guest presenter
Published:
teaching
COMP/ENGN 4528/6528 Computer Vision
Undergraduate and master course, Australian National University, College of Systems and Society , 2022
ENGN8501 Advanced Topics in Computer Vision
Master course, Australian National University, College of Systems and Society , 2022
