Abrir menu de navegação
AIDive
PT
Entrar
Voltar ao glossário

Attention Mechanisms

Deep Learning

Neural network components that help the model identify important parts of the input data and take into account the relationships between them.

Definição

Attention mechanisms have become a key idea for modern language models and transformers. Instead of treating all parts of the text or image equally, the model evaluates which elements are more important for the current step. This helps you connect words in a long phrase, take context into account, and find meaningful relationships.

Exemplo

In the sentence “she put the book on the table because it was empty,” the attention mechanism helps the model associate the pronoun with the correct object.

Por que importa

The term is important for understanding why modern models work better with text, code, images and long context.

Como funciona

The model calculates attention weights between input elements. The higher the weight, the more strongly one element influences the processing of another.

Onde é usado

  • language models
  • machine translation
  • image and text analysis

Limitações

Attention requires computational resources, especially on long sequences. It is also not always a direct explanation of the model's thinking.

FAQ

Why is “Attention Mechanisms” useful to know?

The term is important for understanding why modern models work better with text, code, images and long context.