Navigationsmenü öffnen
AIDive
DE
Anmelden
Zurück zum Glossar

Actor-Critic Methods

Machine Learning

A class of reinforcement learning methods where one component selects actions and another evaluates their utility.

Definition

There are two roles in actor-critic methods. The actor proposes an action in the environment, and the critic evaluates the expected benefit and helps the actor learn faster. This approach is often used where the agent must make sequential decisions: in robotics, games, resource management or route optimization.

Beispiel

The robot learns to walk: the actor chooses the movement of the leg, and the critic evaluates whether it helped maintain balance and get closer to the goal.

Warum es wichtig ist

The term is important for understanding autonomous agents and systems that do not just classify data, but learn to act in a changing environment.

So funktioniert es

The agent receives the state of the environment, selects an action, receives a reward, and updates two parts of the model: the action strategy and the evaluation function.

Wo es genutzt wird

  • reinforcement learning
  • robotics
  • autonomous agents and games

Einschränkungen

Methods can be unstable, require many simulations, and are sensitive to rewards. An incorrectly specified reward leads to unexpected behavior by the agent.

FAQ

Why is “Actor-Critic Methods” useful to know?

The term is important for understanding autonomous agents and systems that do not just classify data, but learn to act in a changing environment.