Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Articles

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

about 1 month ago

Memory-efficient Diffusion Transformers with Quanto and Diffusers

🧨 Diffusers welcomes Stable Diffusion 3

🤗 PEFT welcomes new merging methods

Welcome aMUSEd: Efficient Text-to-Image Generation

SDXL in 4 steps with Latent Consistency LoRAs

Personal Copilot: Train Your Own Coding Assistant

Exploring simple optimizations for SDXL

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Efficient Controllable Generation for SDXL with T2I-Adapters

Happy 1st anniversary 🤗 Diffusers!

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Instruction-tuning Stable Diffusion with InstructPix2Pix

Training a language model with 🤗 Transformers using TensorFlow and TPUs

ControlNet in Diffusers 🧨

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

A Dive into Pretraining Strategies for Vision-Language Models

The State of Computer Vision at Hugging Face 🤗

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Image Similarity with Hugging Face Datasets and Transformers

Deploying 🤗 ViT on Vertex AI

Deploying 🤗 ViT on Kubernetes with TF Serving

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Organizations

Posts 13

Post

2056

It's been a while we shipped native quantization support in diffusers 🧨

We currently support bistandbytes as the official backend but using others like torchao is already very simple.

This post is just a reminder of what's possible:

1. Loading a model with a quantization config
2. Saving a model with quantization config
3. Loading a pre-quantized model
4. enable_model_cpu_offload()
5. Training and loading LoRAs into quantized checkpoints

Docs:
https://huggingface.co/docs/diffusers/main/en/quantization/bitsandbytes

Post

2564

Did some little experimentation to resize pre-trained LoRAs on Flux. I explored two themes:

* Decrease the rank of a LoRA
* Increase the rank of a LoRA

The first one is helpful in reducing memory requirements if the LoRA is of a high rank, while the second one is merely an experiment. Another implication of this study is in the unification of LoRA ranks when you would like to torch.compile() them.

Check it out here:
sayakpaul/flux-lora-resizing

Collections 1

Papers 11

arxiv:2408.13467

arxiv:2406.06424

arxiv:2404.01197

arxiv:2402.17412

spaces 19

Demo Docker Gradio

Diffusers Docs QA Chatbot

Ask questions to the Diffusers documentation.

Convert Kerascv SD to Diffusers

Inpainting Tool

Generate Custom Pokemons with Stable Diffusion

Evaluate StableDiffusionPipeline with Different Schedulers

models 72

sayakpaul/mochi-lora

Text-to-Video • Updated about 17 hours ago • 1

sayakpaul/mochi-lora-lr_1e-5-w_none-bit_no8bit

Updated about 23 hours ago

sayakpaul/mochi-lora-lr_1e-5-w_logit_normal-bit_no8bit

Updated 1 day ago

sayakpaul/mochi-lora-lr_1e-4-w_none-bit_no8bit

Updated 1 day ago

sayakpaul/mochi-lora-lr_1e-4-w_logit_normal-bit_no8bit

Updated 1 day ago

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_5e-4_rank_

Text-to-Video • Updated 4 days ago • 8 • 1

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_3e-4_rank_

Text-to-Video • Updated 4 days ago • 7 • 1

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_1e-4_rank_

Text-to-Video • Updated 5 days ago • 7 • 1

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_5e-4

Text-to-Video • Updated 6 days ago • 8

sayakpaul/optimizer_adamw_steps_1000_lr-schedule_cosine_with_restarts_learning-rate_3e-4

Text-to-Video • Updated 6 days ago • 9

datasets 29

sayakpaul/pd12m-full

Viewer • Updated 3 days ago • 10.8M • 1.32k • 6

sayakpaul/pick-a-pic-v2-unique-prompts

Viewer • Updated 11 days ago • 59k • 111

sayakpaul/sample-datasets

Viewer • Updated 21 days ago • 6 • 23.5k • 1

sayakpaul/poses-controlnet-dataset

Viewer • Updated Aug 29 • 496 • 61 • 5

sayakpaul/torchao-diffusers

Updated Aug 28 • 89

sayakpaul/pickapic_v2_webdataset

Viewer • Updated Apr 4 • 8.7k • 458

sayakpaul/generated-gemini-responses

Viewer • Updated Apr 1 • 115 • 40

sayakpaul/no_robots_only_coding

Viewer • Updated Mar 20 • 350 • 50 • 1

sayakpaul/diffusers-qa-chatbot-artifacts

Viewer • Updated Mar 9 • 265k • 227 • 1

sayakpaul/mgie-results

Viewer • Updated Feb 16 • 8 • 49