Abrir menu de navegação
AIDive
PT
Entrar
Voltar ao glossário

Automatic Speech Recognition (ASR)

Natural Language Processing

Technology that turns spoken speech into text using audio processing and language models.

Definição

Automatic speech recognition is used in captioning, voice assistants, dictation, call analytics, meeting minutes, and content accessibility. The system must process the sound, separate speech from noise, recognize words and assemble them into meaningful text.

Exemplo

After an online meeting, the service automatically creates a transcript of the conversation and a short summary.

Por que importa

The term is important for users looking for transcription, voice typing, call analysis, or subtitling tools.

Como funciona

The system analyzes the audio signal, extracts speech features, matches them with likely words, and uses context to select the most plausible phrase.

Onde é usado

  • transcription
  • subtitles
  • voice assistants and call centers

Limitações

Quality depends on language, accent, noise, microphone, overlapping voices and specialized terminology.

FAQ

Why is “Automatic Speech Recognition (ASR)” useful to know?

The term is important for users looking for transcription, voice typing, call analysis, or subtitling tools.