Simon Brandeis's picture

Simon Brandeis

sbrandeis

·

SBrandeis

AI & ML interests

None yet

Articles

Subscribe to Enterprise Hub with your AWS Account

Deprecation of Git Authentication using password

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Introducing our new pricing

Organizations

sbrandeis's activity

upvoted an article 5 months ago

Article

BrAIn: next generation neurons?

By

•

Jun 5

• 15

upvoted an article 6 months ago

Article

Benchmarking Text Generation Inference

May 29

• 27

upvoted 2 collections 7 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Sep 25 • 683

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 88

upvoted a paper 8 months ago

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

upvoted a paper 9 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

upvoted 2 papers 10 months ago

Locally Typical Sampling

Paper • 2202.00666 • Published Feb 1, 2022 • 2

Masked Audio Generation using a Single Non-Autoregressive Transformer

Paper • 2401.04577 • Published Jan 9 • 42

upvoted a collection 10 months ago

MAGNeT

Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated Apr 4 • 40

upvoted 4 papers 11 months ago

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Paper • 2307.13304 • Published Jul 25, 2023 • 2

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Paper • 2312.09767 • Published Dec 15, 2023 • 25

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Paper • 2312.02145 • Published Dec 4, 2023 • 5

upvoted 2 collections 11 months ago

Notus 7B v1

Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus • 11 items • Updated Jul 30 • 18

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6 • 230

upvoted 4 papers 12 months ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 138

Positional Description Matters for Transformers Arithmetic

Paper • 2311.14737 • Published Nov 22, 2023 • 2

Thinking Fast and Slow in Large Language Models

Paper • 2212.05206 • Published Dec 10, 2022 • 1

A Watermark for Large Language Models

Paper • 2301.10226 • Published Jan 24, 2023 • 8

upvoted a paper about 1 year ago

Memory Augmented Language Models through Mixture of Word Experts

Paper • 2311.10768 • Published Nov 15, 2023 • 16