ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

None yet

Recent Activity

liked a dataset about 4 hours ago

bigcode/stackoverflow-clean

upvoted a collection 2 days ago

The Big Benchmarks Collection

upvoted a collection 2 days ago

Open LLM Leaderboard best models ❤️‍🔥

Organizations

ldwang's activity

upvoted 2 collections 2 days ago

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated 4 days ago • 158

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 57 items • Updated about 1 hour ago • 442

upvoted a paper 3 days ago

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1 • 81

upvoted a paper 4 days ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 126

upvoted a collection 7 days ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated 6 days ago • 36

upvoted an article 7 days ago

Article

Releasing the largest multilingual open pretraining dataset

By

•

8 days ago

• 94

upvoted a collection 10 days ago

LLMs

302 items • Updated about 19 hours ago • 20

upvoted a collection 17 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 15 days ago • 95

upvoted an article 22 days ago

Article

Code a simple RAG from scratch

By

•

23 days ago

• 8

upvoted a paper 23 days ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published 28 days ago • 18

upvoted a collection 23 days ago

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22 • 41

upvoted a collection 26 days ago

ScaleQuest

We introduce ScaleQuest, a scalable and novel data synthesis method. Project Page: https://scalequest.github.io/ • 8 items • Updated 28 days ago • 4

upvoted a paper 28 days ago

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Paper • 2410.18505 • Published 29 days ago • 8

upvoted a collection 28 days ago

Infinity MM

5 items • Updated 13 days ago • 3

upvoted 2 collections about 1 month ago

Aquila

19 items • Updated 10 days ago • 3

CCI

Chinese Corpora Internet(中文互联网语料) • 11 items • Updated 22 days ago • 2

upvoted a paper about 1 month ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 91

upvoted 3 collections about 1 month ago

Infinity Instruct

16 items • Updated 28 days ago • 6

IndustryCorpus

19 items • Updated 28 days ago • 5

IndustryCorpus2

多语种多行业预训练数据集 • 35 items • Updated 17 days ago • 4