-
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 50 -
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 138 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 70 -
hustvl/Vim-tiny
Updated • 19
Michael Schock
mjschock
AI & ML interests
None yet
Organizations
None yet
Collections
1
spaces
1
models
37
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads
Updated
•
477
mjschock/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
Updated
•
420
mjschock/sft_mjschock-chat_threads
Updated
•
22
mjschock/sft_openassistant-guanaco
Updated
mjschock/TinyLlama-1.1B-Chat-v1.0-qlora-ultrachat
Updated
mjschock/TinyLlama-1.1B-2.5T-chat-and-function-calling-Q4_K_M-GGUF
Text Generation
•
Updated
•
24
mjschock/TinyLlama-1.1B-Chat-v1.0-Q8_0-GGUF
Updated
•
8
•
1
mjschock/SmolLM-135M-Q4_K_M-GGUF
Updated
•
8
•
1
mjschock/open_llama_3b_v2-Q8_0-GGUF
Updated
•
17
•
1
mjschock/TinySolar-248m-4k-py-Q4_K_M-GGUF
Updated
•
6
•
1