Minwoo Park's picture

Minwoo Park

danielpark

·

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

sentence-transformers/distilbert-multilingual-nli-stsb-quora-ranking

liked a model 29 days ago

google-bert/bert-base-multilingual-cased

liked a model 29 days ago

sentence-transformers/paraphrase-multilingual-mpnet-base-v2

Organizations

danielpark's activity

upvoted a paper 3 months ago

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Paper • 2402.12226 • Published Feb 19 • 41

upvoted a paper 4 months ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11 • 31

upvoted an article 4 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 369

upvoted 2 papers 5 months ago

Contrastive Prefence Learning: Learning from Human Feedback without RL

Paper • 2310.13639 • Published Oct 20, 2023 • 24

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12 • 39

upvoted a collection 7 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 30

upvoted 3 articles 7 months ago

Article

Can We Train Chat Models with Raw Data?

By

•

Apr 25

• 17

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 33

Article

CodeGemma - an official Google release for code LLMs

Apr 9

• 99

upvoted a collection 8 months ago

Korean-Adapted Model Series

Korean-adapted Language Model Series • 13 items • Updated May 17 • 24

upvoted 4 papers 8 months ago

MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

Paper • 2403.19888 • Published Mar 29 • 10

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 65

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

Paper • 2310.16787 • Published Oct 25, 2023 • 5

sDPO: Don't Use Your Data All at Once

Paper • 2403.19270 • Published Mar 28 • 40

upvoted a collection 9 months ago

Sora Reference Papers

A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Oct 3 • 51

upvoted a collection about 1 year ago

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated Apr 12 • 145

upvoted a paper about 1 year ago

Hardwiring ViT Patch Selectivity into CNNs using Patch Mixing

Paper • 2306.17848 • Published Jun 30, 2023 • 8

upvoted 3 papers over 1 year ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28

Platypus: Quick, Cheap, and Powerful Refinement of LLMs

Paper • 2308.07317 • Published Aug 14, 2023 • 23

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 41