-
The LLM Surgeon
Paper • 2312.17244 • Published • 9 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 64 -
Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models
Paper • 2401.06102 • Published • 19 -
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Paper • 2407.08770 • Published • 19
Anubrata Das
anubrata
AI & ML interests
Fairness, Explainability and Interpretability in NLP Models, Computational Social Science