BabyLM Submissions
Collection
Collection contains relevant models for the BabyLM 2024 submission. The 100m model is for the strict and the 10m is for the strict-small track
•
2 items
•
Updated
Strict-small Track submission.
To download and use this model the fla package has to be installed:
pip install -U git+https://github.com/sustcsonglin/flash-linear-attention