Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

liked a dataset 1 day ago

Maple728/Time-300B

liked a Space 9 days ago

Salesforce/GIFT-Eval

liked a model about 1 month ago

jimmycarter/LibreFLUX

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

upvoted a paper about 1 month ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16 • 1

upvoted a paper about 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted a paper 3 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 7

upvoted a collection 3 months ago

Power-LM

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted 2 papers 3 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 115

upvoted a paper 4 months ago

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31 • 3

upvoted 4 articles 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12

• 92

Article

The Annotated Diffusion Model

Jun 7, 2022

• 105

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 123

upvoted 3 papers 5 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14 • 22

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10 • 12

upvoted 2 papers 8 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 62

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 20

upvoted 2 collections 8 months ago

Moirai-R models

10 items • Updated 20 days ago • 33

Chronos Models & Datasets

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 8 items • Updated Jun 27 • 31

upvoted a collection 10 months ago

datasets-SPIN

Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 11

upvoted a paper 11 months ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 14