Senichev Sergei's picture

9 43

Senichev Sergei

seniichev

·

ssenichev

AI & ML interests

RecSys, DataAnalysis

Recent Activity

liked a model 9 days ago

msu-rcc-lair/RuadaptQwen2.5-32B-instruct

liked a model 27 days ago

google/gemma-2-9b

liked a model about 1 month ago

RefalMachine/ruadapt_qwen2.5_3B_ext_u48_instruct_v4

Organizations

seniichev's activity

upvoted a paper about 2 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 91

upvoted a paper 2 months ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10 • 63

upvoted a collection 3 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted 2 collections 4 months ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 198

H2O Danube3

6 items • Updated Oct 17 • 53

upvoted a paper 6 months ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22 • 53

upvoted 2 collections 9 months ago

AQLM

AQLM quantized LLMs • 20 items • Updated May 3 • 44

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Sep 18 • 206

upvoted a paper 9 months ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16 • 79