Max Context length?

by lazyDataScientist - opened Feb 26

Discussion

lazyDataScientist

Feb 26

Just wondering what the max context length for this model has at the moment

gamendez98

Feb 27

It doesn’t have a hard-coded max context length like a transformer. It works kind of like a LSTM. You can just keep adding input and it will keep going. It “remembers” the mast context selectively so it doesn’t loose too much performance.

see: https://arxiv.org/pdf/2312.00752.pdf

Here they talk about it in the part about synthetic tasks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment