AIDive
Back to glossary

What is Active Learning

GlossaryMachine Learning

A machine learning method in which the model itself selects the most useful examples for human labeling.

Definition

Active learning helps you save time and budget on data labeling. Instead of labeling thousands of random examples, the system finds the cases in which the model doubts the most. The expert marks them, and the model quickly improves the quality.

Example

For an image moderation system, a model can ask a person to mark controversial images where she is not sure whether this is acceptable content or a violation.

Why it matters

This is important for projects where markup is expensive: medicine, legal documents, industrial control, rare languages ​​or complex moderation.

How it works

The model is trained on a small sample, evaluates unlabeled data, selects the most vague or informative examples, receives expert responses, and repeats the cycle.

Where it is used

  • data markup
  • training models with an expert
  • improving quality on a small budget

Limitations

The method does not help if the initial data is bad or the expert labels inconsistently. It is important to ensure that the model does not become fixated on a narrow group of complex examples.