-
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 51 -
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 138 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 71 -
hustvl/Vim-tiny
Updated • 19
Michael Schock
mjschock
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 5 hours ago
gaia-benchmark/GAIA
liked
a dataset
1 day ago
bigcode/self-oss-instruct-sc2-exec-filter-50k
liked
a dataset
3 days ago
microsoft/orca-agentinstruct-1M-v1
Organizations
None yet
Collections
1
spaces
1
models
39
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads-11-10-24-v2
Updated
•
22
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads-11-10-24
Updated
•
24
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads
Updated
•
158
mjschock/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
Updated
•
466
mjschock/sft_mjschock-chat_threads
Updated
•
6
mjschock/sft_openassistant-guanaco
Updated
mjschock/TinyLlama-1.1B-Chat-v1.0-qlora-ultrachat
Updated
mjschock/TinyLlama-1.1B-2.5T-chat-and-function-calling-Q4_K_M-GGUF
Text Generation
•
Updated
•
5
mjschock/TinyLlama-1.1B-Chat-v1.0-Q8_0-GGUF
Updated
•
7
•
1
mjschock/SmolLM-135M-Q4_K_M-GGUF
Updated
•
6
•
1