Top-p sampling selects from the smallest set of tokens whose cumulative probability exceeds 'p'. This dynamically adjusts the pool size based on the model's confidence.
Introduced to solve issues with rigid Top-k limits.
Preferred over Top-k for generating creative text.