base model for mono-channel completion
remove background from any image
3D/4D Scenes from a Single Image w/ Controllable Video Diff
An end-to-end (e2e) Voice Language Model by Fish Audio.