FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Paper • 2407.04051 • Published Jul 4 • 35
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published 22 days ago • 44
PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing Paper • 2409.10831 • Published 3 days ago • 3