1 35 185

Santiago Garcia

santyzenith

AI & ML interests

Large language models, Natural Language Processing, Computer Vision, Spanish Large language models.

Recent Activity

updated a collection about 6 hours ago

Whisper-ECU 911

updated a collection about 6 hours ago

Whisper-ECU 911

updated a collection about 6 hours ago

Whisper-ECU 911

Organizations

santyzenith's activity

upvoted a collection about 2 months ago

LLM2Vec

Collection

16 items • Updated Oct 8 • 34

upvoted 2 articles about 2 months ago

Article

Train a Llama model from scratch

•

Jul 29

• 45

Article

Vision Language Models Explained

Apr 11

• 214

upvoted an article 2 months ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Mar 9, 2023

• 34

upvoted 2 papers 3 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 38

MoLE : Mixture of Language Experts for Multi-Lingual Automatic Speech Recognition

Paper • 2302.13750 • Published Feb 27, 2023 • 2

upvoted 3 articles 3 months ago

Article

Introduction to Graph Machine Learning

Jan 3, 2023

• 16

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 215

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 123

upvoted a paper 4 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17 • 49

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 265

upvoted an article 5 months ago

Article

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

Oct 21, 2022

• 14

upvoted a paper 5 months ago

Tuna: Instruction Tuning using Feedback from Large Language Models

Paper • 2310.13385 • Published Oct 20, 2023 • 10

upvoted a collection 5 months ago

Knowledge distillation

Collection

88 items • Updated Feb 7 • 6

upvoted 2 articles 5 months ago

Article

Putting RL back in RLHF

Jun 12

• 62

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 120

upvoted 4 papers 5 months ago

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 10

Estimating Knowledge in Large Language Models Without Generating a Single Token

Paper • 2406.12673 • Published Jun 18 • 7

A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Paper • 2406.11289 • Published Jun 17 • 5

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24 • 67