Anthony Ivan S

anthonyivn

anthonyivn2

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

liked a model about 1 month ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

liked a model about 1 month ago

meta-llama/Llama-3.2-1B

Organizations

None yet

anthonyivn's activity

upvoted a paper 14 days ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 16 days ago • 60

upvoted an article about 2 months ago

Article

Document Similarity Search with ColPali

•

Sep 21

• 47

upvoted 2 papers 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

upvoted a paper 3 months ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 13

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 78

upvoted a paper 4 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 128

upvoted a collection 5 months ago

InternLM2.5

Collection

14 items • Updated Sep 14 • 70

upvoted 3 papers 5 months ago

upvoted 2 articles 5 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 369

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a paper 6 months ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30 • 21

upvoted an article 6 months ago

Article

Hugging Face on AMD Instinct MI300 GPU

May 21

• 10

upvoted 3 papers 7 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Paper • 2404.10198 • Published Apr 16 • 7

upvoted an article 7 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 278

upvoted a paper 8 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 64