Hieu Ngo's picture

Hieu Ngo

hiieu

·

AI & ML interests

Applied, Post-Training LLM

Organizations

hiieu's activity

upvoted a paper 4 days ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published 8 days ago • 20

upvoted a paper 10 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 14 days ago • 76

upvoted a paper 11 days ago

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published 18 days ago • 17

upvoted an article 19 days ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

19 days ago

• 30

upvoted an article 25 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

26 days ago

• 54

upvoted a paper about 2 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9 • 45

upvoted a collection 2 months ago

Gemma 2 ChatQA RAG finetuned

1 item • Updated Sep 2 • 1

upvoted an article 3 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted 2 papers 3 months ago

Synthesizing Text-to-SQL Data from Weak and Strong LLMs

Paper • 2408.03256 • Published Aug 6 • 10

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1 • 21

upvoted a collection 3 months ago

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 31 • 11

upvoted a paper 4 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19 • 37

upvoted a collection 4 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted a paper 4 months ago

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19 • 24

upvoted a collection 4 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 62

upvoted 2 articles 4 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16

• 32

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 258

upvoted a collection 4 months ago

H2O Danube3

6 items • Updated 22 days ago • 53

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 75

upvoted a collection 4 months ago

Gemma 2 Release

15 items • Updated Sep 9 • 193