Satyam's picture

Satyam

satyamt

·

AI & ML interests

Biotechnology

Recent Activity

liked a model 10 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

liked a model 16 days ago

tencent/Tencent-Hunyuan-Large

liked a model 20 days ago

fishaudio/fish-agent-v0.1-3b

Organizations

satyamt's activity

upvoted a collection about 1 month ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated Oct 15 • 139

upvoted a collection about 2 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 271

upvoted an article 3 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5

• 161

upvoted a paper 3 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29 • 52

upvoted an article 3 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 210

upvoted a paper 3 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 52

upvoted a collection 4 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Jul 31 • 137

upvoted an article 4 months ago

Article

Constitutional AI with Open LLMs

Feb 1

• 12

upvoted a collection 4 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36

upvoted 2 articles 4 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 28

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24

• 177

upvoted 2 papers 5 months ago

GenQA: Generating Millions of Instructions from a Handful of Prompts

Paper • 2406.10323 • Published Jun 14 • 5

Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

Paper • 2406.00888 • Published Jun 2 • 30

upvoted a collection 6 months ago

sentence-transformers-from-synthetic-data

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated Jun 21 • 21

upvoted a paper 6 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23 • 37

upvoted an article 6 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 66

upvoted a paper 6 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 13

upvoted 2 articles 7 months ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

Jun 23

• 65

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 118

upvoted a paper 7 months ago

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14 • 43