AIDive
Back to glossary

What is Data Annotation

GlossaryAI Infrastructure

Adding explanatory labels to data so the model can learn from examples.

Definition

Data Annotation is the act of adding explanatory labels to data so that the model can learn from examples. Simply put, this concept helps build reliable services around models: data, compute, access, deployment and monitoring. In practice, it helps to understand what capabilities the tool actually has, what data it will need, and what limitations are worth checking before implementation.

Example

Markers mark road signs on images so that the computer vision model can learn to find them.

Why it matters

The quality of the layout often determines the quality of the model more than the choice of fashionable architecture. This helps you choose AI tools not by big promises, but by how they work in a real problem.

How it works

Typically, the process starts with data sources and the environment, then sets up calculations, access, automation, monitoring, and security rules. In the case of the term “Data Markup”, it is important to look separately at the data, quality criteria and application conditions.

Where it is used

  • It is found in projects where data storage, computing, integration, deployment, security and stable operation of AI services are important.

Limitations

Limitations are related to computational cost, security, data quality, latency, service availability, and maintenance complexity.