Softmax is an activation function that converts a vector of raw scores (logits) into a probability distribution that sums to 1.0.
Standard for multi-class classification.
The final layer of almost every classifier.