Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms Paper • 2410.18967 • Published 28 days ago • 1
Diffusion DPO LoRA Collection How to train: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/diffusion_dpo • 4 items • Updated Jan 12 • 5
GGUF Image Model Quants Collection List of GGUF quants for text to image base models. • 9 items • Updated 23 days ago • 12
Chronos Models & Datasets Collection Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 8 items • Updated Jun 27 • 31
ReLiK: Retrieve, Read and LinK Collection A blazing fast and lightweight Information Extraction model for Entity Linking and Relation Extraction. • 20 items • Updated Aug 8 • 22
Applied Machine Learning Papers Collection Reading List (Mainly Focused of VLM's and Diffusion Models) • 46 items • Updated 9 days ago • 1
Uni-SMART: Universal Science Multimodal Analysis and Research Transformer Paper • 2403.10301 • Published Mar 15 • 52
DCoT Collection Models from the paper "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" • 6 items • Updated Jul 5 • 1
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 9 items • Updated 3 days ago • 70
Zero-Shot Voice Cloning Collection TTS models that support zero-shot voice cloning • 7 items • Updated 26 days ago • 7
ControlAR: Controllable Image Generation with Autoregressive Models Paper • 2410.02705 • Published Oct 3 • 8
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Paper • 2410.01699 • Published Oct 2 • 18
SILC: Improving Vision Language Pretraining with Self-Distillation Paper • 2310.13355 • Published Oct 20, 2023 • 7