Yu Zhang's picture

9 8 5

Yu Zhang

AaronZ345

·

https://aaronz345.github.io

AI & ML interests

Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).

Organizations

authored a paper 2 months ago

MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations

Paper • 2510.10396 • Published Oct 12

authored a paper 4 months ago

ASAudio: A Survey of Advanced Spatial Audio Research

Paper • 2508.10924 • Published Aug 8 • 1

authored 4 papers 5 months ago

Conan: A Chunkwise Online Network for Zero-Shot Adaptive Voice Conversion

Paper • 2507.14534 • Published Jul 19

Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly

Paper • 2505.00426 • Published May 1

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Paper • 2505.14910 • Published May 20 • 1

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation

Paper • 2507.06670 • Published Jul 9

authored 5 papers 8 months ago

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Paper • 2502.12572 • Published Feb 18 • 2

MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Paper • 2502.18924 • Published Feb 26 • 16

Versatile Framework for Song Generation with Prompt-based Control

Paper • 2504.19062 • Published Apr 27 • 6

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Paper • 2504.20630 • Published Apr 29 • 9

Robust Singing Voice Transcription Serves Synthesis

Paper • 2405.09940 • Published May 16, 2024 • 1

authored 3 papers 11 months ago

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Paper • 2409.13832 • Published Sep 20, 2024 • 1

TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

Paper • 2409.15977 • Published Sep 24, 2024 • 2

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis

Paper • 2312.10741 • Published Dec 17, 2023 • 1