Model Description
SpikeGPT-OpenWebText-216M is a L18-D768 SpikeGPT model trained on OpenWebText. See https://github.com/ridgerchu/SpikeGPT for details.
ctx_len = 1024 n_layer = 18 n_embd = 768
Model Description
SpikeGPT-OpenWebText-216M is a L18-D768 SpikeGPT model trained on OpenWebText. See https://github.com/ridgerchu/SpikeGPT for details.
ctx_len = 1024 n_layer = 18 n_embd = 768