Greedy Decoding

What is Greedy Decoding?

Greedy decoding is a generation strategy where the model always selects the single most probable next token. It is deterministic but can lead to repetitive text.

Where did the term "Greedy Decoding" come from?

Simplest form of decoding.

How is "Greedy Decoding" used today?

Used when deterministic output is required (e.g., math, code).

Related Terms

Inference
Temperature
Large Language Model (LLM)