WMDP Benchmark Collection The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning • 9 items • Updated Apr 23 • 6
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 242