Nikolay Kozlov's picture

Nikolay Kozlov

NikolayKozloff

·

AI & ML interests

None yet

Recent Activity

liked a model about 16 hours ago

Khetterman/DarkAtom-12B-v3

liked a model about 16 hours ago

Khetterman/Llama-3.2-Kapusta-3B-v8

liked a model about 16 hours ago

qingy2019/NaturalLM-GGUF

Organizations

None yet

NikolayKozloff's activity

upvoted a collection 3 days ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 2 items • Updated Oct 1 • 2

upvoted 2 collections 6 days ago

LLäMmlein Chat Preview 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 7 items • Updated about 22 hours ago • 6

LLäMmlein 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 5 items • Updated 2 days ago • 6

upvoted 3 collections 10 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 223

🍓 Ichigo v0.4

The experimental family designed to train LLMs to understand sound natively. • 2 items • Updated 11 days ago • 6

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 57 items • Updated 25 minutes ago • 442

upvoted a collection 13 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 9 items • Updated 4 days ago • 70

upvoted a collection 16 days ago

OS-Atlas

OS-Atlas series models • 7 items • Updated 3 days ago • 12

upvoted a collection 20 days ago

QTIP Quantized Models

See https://github.com/Cornell-RelaxML/qtip • 27 items • Updated 25 days ago • 5

upvoted 2 collections 21 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 10 items • Updated about 5 hours ago • 172

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 15 days ago • 95

upvoted a collection 28 days ago

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8 • 9

upvoted a collection 30 days ago

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20 • 23

upvoted 7 collections about 1 month ago

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 17 days ago • 89

v4

18 items • Updated Oct 20 • 24

Arch-Function

6 items • Updated 23 days ago • 8

ApolloMoE & Apollo2

English, Chinese, French, Hindi, Spanish, Arabic, Russian, Japanese, Korean, German, Italian, Portuguese and 38 Minor Languages • 7 items • Updated Oct 15 • 3

LoLCATS

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14 • 14

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22 • 41

Qwen2

Qwen2 language models, instruction-tuned models of 3 sizes: 0.5B, 1.5B, 7B. • 3 items • Updated Jun 13 • 1