Snowflake
/

snowflake-arctic-instruct

Text Generation

Mixture of Experts

Model card Files Files and versions Community

Fix the vllm deepspeedfp not found issue

#9

by ThWu - opened Apr 26

base: refs/heads/main

←

from: refs/pr/9

Discussion Files changed

Files changed (1) hide show

quant_config.json +6 -0

quant_config.json ADDED Viewed

	@@ -0,0 +1,6 @@

+{
+    "bits": 8,
+    "rounding": "nearest",
+    "mantissa_bits": 3,
+    "group_size": 512
+}