Navigationsmenü öffnen
AIDive
DE
Anmelden
Zurück zum Glossar

Automatic Speech Recognition (ASR)

Natural Language Processing

Technology that turns spoken speech into text using audio processing and language models.

Definition

Automatic speech recognition is used in captioning, voice assistants, dictation, call analytics, meeting minutes, and content accessibility. The system must process the sound, separate speech from noise, recognize words and assemble them into meaningful text.

Beispiel

After an online meeting, the service automatically creates a transcript of the conversation and a short summary.

Warum es wichtig ist

The term is important for users looking for transcription, voice typing, call analysis, or subtitling tools.

So funktioniert es

The system analyzes the audio signal, extracts speech features, matches them with likely words, and uses context to select the most plausible phrase.

Wo es genutzt wird

  • transcription
  • subtitles
  • voice assistants and call centers

Einschränkungen

Quality depends on language, accent, noise, microphone, overlapping voices and specialized terminology.

FAQ

Why is “Automatic Speech Recognition (ASR)” useful to know?

The term is important for users looking for transcription, voice typing, call analysis, or subtitling tools.