Data Selection for Language Models via Importance Resampling Paper • 2302.03169 • Published Feb 6, 2023
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published Jun 17 • 30