GGUF

What is GGUF?

GGUF is a file format designed for running LLMs on consumer hardware (CPUs + GPUs). It supports offloading layers to the GPU to work around VRAM limits.

Where did the term "GGUF" come from?

Popularized by the llama.cpp project.

How is "GGUF" used today?

The standard for local LLM inference on MacBooks and PCs.

Related Terms