MIT, Northeastern & Technion Propose ROME for Efficient Locating and Editing of Factual Associations in GPT Models

In the new paper Locating and Editing Factual Associations in GPT, a research team from MIT CSAIL, Northeastern University and Technion IIT examines how information flows during knowledge recall in large autoregressive transformers and introduces Rank-One Model Editing (ROME), a simple, zero-shot principled model editor capable of locating and editing factual associations in such models.

In recent years, much of the research interest in large language models (LLMs) such as OpenAI’s autoregressive GPT has shifted from what these models can do to how they do it. While LLMs have demonstrated impressive prediction consistency with factual knowledge, their computations remain opaque. Knowing how and where such factual associations are stored and retrieved and improving understanding of the mechanisms underlying autoregressive knowledge representations are crucial for further model development and deployment.

In the new paper Locating and Editing Factual Associations in GPT, a research team from MIT CSAIL, Northeastern University and Technion IIT examines how information flows during knowledge recall in large autoregressive transformers and introduces Rank-One Model Editing (ROME), a simple, zero-shot principled model editor capable of locating and editing factual associations in such models.

Knowing how a well-performing language transformer architecture stores its factual associations can help machine learning researchers address errors involving incorrect, biased, or private information by directly editing the factual associations.

The team introduces a novel Causal Tracing method to identify the decisive computations that mediate factual recall. The method isolates the causal effects of individual states in the neural network while processing a factual statement. By tracing this information flow, it is possible to identify the modules that principally contribute to factual association retrieval.

The proposed ROME is designed for editing individual facts within a GPT model. ROME treats a single module as a key-value store in which the key encodes a subject, and the value encodes the corresponding knowledge of this subject. The model can thus recall factual associations by retrieving the value corresponding to the key, enabling the associations of individual facts to be edited and updated in both specific and generalized ways.

The team evaluated ROME on the Zero-Shot Relation Extraction (zsRE) task and on their own CounterFact dataset, which includes thousands of counterfactuals and text that allows quantitative testing of specificity and generalization when learning a counterfactual. In the evaluations, ROME showed competitive results on zsRE and maintained both specificity and generalization on the CounterFact dataset.

Overall, this work pinpoints the crucial role of mid-layer feedforward modules in storing factual associations, reveals the information flow of knowledge recall in autoregressive transformers, and demonstrates the capability of editing factual associations in such LLMs.

The code, dataset, visualizations, and an interactive demo notebook are available at https://rome.baulab.info/. The paper Locating and Editing Factual Associations in GPT is on arXiv.

Author: Hecate He | Editor: Michael Sarazen

We know you don’t want to miss any news or research breakthroughs. Subscribe to our popular newsletter Synced Global AI Weekly to get weekly AI updates.

8 comments on “MIT, Northeastern & Technion Propose ROME for Efficient Locating and Editing of Factual Associations in GPT Models”

zolisali

2023-03-19

This is such a great post

Loading...

Reply
Salima

2023-05-10

Good post. I was constantly checking this blog and I got good information.

Loading...

Reply
Salima

2023-05-14

Thank you for the good article.

Loading...

Reply
Salima

2023-10-03

His information totally impressive and best blog………………………..BY Salima FERHAT-FLL

Loading...

Reply
Anonymous

2023-12-16

1*1

Loading...

Reply
Anonymous

2023-12-16

1��%2527%2522

Loading...

Reply
MikeNike

2024-02-28

Hello, I would like to advise you to play on the best sports betting platform in Cameroon, namely MelBet. The advantages are reliability, good reputation and a huge selection of sports events to bet on. In addition, they operate legally, which is confirmed by an international license from Curacao. So, don’t hesitate and apk download, because it is the right choice. I hope you are lucky and use the newbie bonuses in time. Have a good bet and a great evening!

Loading...

Reply
Fifos Lilio

2024-11-20

I’ve always been a little skeptical about hearing devices, but after my experience here, I’m a total believer. The team was so professional and kind, making sure I understood every step of the process. After a quick and painless test, they showed me a range of options, and I was surprised by how sleek and modern they were. I chose one that’s practically invisible and easy to use, and the difference was immediate. I can finally hear my grandkids’ laughter, have clear phone calls, and enjoy the little sounds of everyday life I didn’t even know I was missing. If you’ve been putting this off, do yourself a favor and check out https://hearwellservices.com/. It’s life-changing.

Loading...

Reply

MIT, Northeastern & Technion Propose ROME for Efficient Locating and Editing of Factual Associations in GPT Models

Like this:

8 comments on “MIT, Northeastern & Technion Propose ROME for Efficient Locating and Editing of Factual Associations in GPT Models”

Leave a Reply Cancel reply

Related

Share this:

Like this:

8 comments on “MIT, Northeastern & Technion Propose ROME for Efficient Locating and Editing of Factual Associations in GPT Models”

Leave a Reply Cancel reply

Related