RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 3 days ago • 37