LoRA (Low-Rank Adaptation)

What is LoRA (Low-Rank Adaptation)?

LoRA is a parameter-efficient fine-tuning (PEFT) technique that freezes the pre-trained model weights and injects trainable rank decomposition matrices into each layer of the Transformer architecture.

Where did the term "LoRA (Low-Rank Adaptation)" come from?

Introduced by Microsoft researchers to reduce the memory requirements of fine-tuning.

How is "LoRA (Low-Rank Adaptation)" used today?

The most popular method for community fine-tuning of open-weights models like Llama and Stable Diffusion.

Related Terms

Fine-tuning
Parameter
Large Language Model (LLM)