AI-LA: Aphasia in Artificial Intelligence Large Language Models

Most AI research focuses on adding capabilities to LLMs. In contrast, little has been done on how to remove these capabilities from pre-trained LLMs.

Finding an approach that scores well on specificity and generalization

A model editing technique scores well on specificity if related facts do not change after the model is edited. A technique scores well on generalization if the fact change is robust to adding or changing the context.

There are three types of approaches to updating parameters - fine-tuning, hyper-networks, and causal tracking.

In this model, I will test all three types!

MODEL GOAL

Reproduce

Fine-tuning
Hyper-networks
causal tracking

Talk to me: https://www.linkedin.com/in/alessandra-faria-b0816053/