Test-Time Compute refers to models that spend time generating hidden 'thinking tokens' before producing a final answer, effectively trading time for increased intelligence (similar to human System 2 thinking).
Popularized by OpenAI's o1 and DeepSeek R1.
A new scaling law for reasoning capabilities.