will's picture

will PRO

wrice

·

AI & ML interests

Interested in the applications of generative models for Speech Synthesis and NLP.

Recent Activity

updated a model 8 days ago

wrice/swin2sr-laion-hd

New activity 9 days ago

wrice/unet1d-vctk-48khz

New activity 9 days ago

wrice/waveunet-vctk-48khz

Organizations

wrice's activity

upvoted a paper 3 months ago

Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data

Paper • 2408.10119 • Published Aug 19 • 16

upvoted a paper 8 months ago

Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 13

upvoted a paper 9 months ago

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21 • 46

upvoted 4 papers about 1 year ago

Holistic Evaluation of Text-To-Image Models

Paper • 2311.04287 • Published Nov 7, 2023 • 11

Improving Sample Quality of Diffusion Models Using Self-Attention Guidance

Paper • 2210.00939 • Published Oct 3, 2022 • 6

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Paper • 2309.15818 • Published Sep 27, 2023 • 19

ProPainter: Improving Propagation and Transformer for Video Inpainting

Paper • 2309.03897 • Published Sep 7, 2023 • 26

upvoted 3 papers over 1 year ago

SpeechX: Neural Codec Language Model as a Versatile Speech Transformer

Paper • 2308.06873 • Published Aug 14, 2023 • 25

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Paper • 2308.01546 • Published Aug 3, 2023 • 17

CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training

Paper • 2305.10763 • Published May 18, 2023 • 3