Choucri Fahed's picture

3 11

Choucri Fahed

choucrifahed

AI & ML interests

None yet

Organizations

None yet

choucrifahed's activity

upvoted a collection 5 months ago

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated 3 days ago • 113

upvoted an article 5 months ago

Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

By

•

Mar 20

• 17

upvoted a collection 8 months ago

Saul-7B: A pioneering Large Language Model for Law

We introduce SaulLM-7B, a LLM tailored for the legal domain trained on 30 billion tokens of legal data. Released under MIT License. • 4 items • Updated Mar 7 • 18