Imbalanced Data

What is Imbalanced Data?

In machine learning, imbalanced data refers to a situation where the number of observations per class is not equally distributed. This is a common problem in many real-world datasets, such as fraud detection and medical diagnosis.

Where did the term "Imbalanced Data" come from?

A common problem in machine learning.

How is "Imbalanced Data" used today?

Can lead to models that are biased towards the majority class.

Related Terms