Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
Paper
•
2402.05406
•
Published
Working on improving reasoning of Bonsai Paper.
Note Original Model with 10 iterations to get 50% sparsity
Note Finetuned Bonsai (pruned on C4) on Wikitext