Inference is the phase where a trained model is put to work, processing live data to make predictions or generate content.
Contrasts with the 'training' phase.
The operational cost of AI is often measured in inference costs.