Jiwoo Hong's picture

Jiwoo Hong

JW17

·

https://jiwooya1000.github.io/

AI & ML interests

NLP, LLM, and any related topics

Organizations

JW17's activity

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Sep 18 • 371

upvoted an article 2 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

upvoted an article 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a paper 5 months ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10 • 12

upvoted a collection 5 months ago

MaPO

This collection includes the models and datasets as a part of the MaPO release. • 9 items • Updated Jun 12 • 5

upvoted a paper 6 months ago

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29 • 10

upvoted 2 articles 7 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24

• 63

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 227

upvoted 2 collections 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

Zephyr ORPO

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12 • 17

upvoted a paper 8 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 62

upvoted a collection 8 months ago

ORPO

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12 • 11