AIDive
Back to glossary

What is Constitutional AI

GlossaryEthics & Safety

A method of training AI where a model's behavior is guided through a set of principles and rules.

Definition

Constitutional AI is a method of training AI where a model's behavior is guided through a set of principles and rules. Simply put, this concept helps assess risk, liability, safety, and compliance. In practice, it helps to understand what capabilities the tool actually has, what data it will need, and what limitations are worth checking before implementation.

Example

The model evaluates and corrects the response based on principles of safety and usefulness, not just manual markup.

Why it matters

The approach is important for secure assistants, but specific principles and implementations need to be checked against the developer's sources. This helps you choose AI tools not by big promises, but by how they work in a real problem.

How it works

First, stakeholders, data, and potential harm are identified, then checks, restrictions, audits, and responsibilities are introduced. In the case of the term “Constitutional AI”, it is important to look separately at the data, quality criteria and application conditions.

Where it is used

  • Important in products where AI impacts people, personal data, security, legal risks or decision making.

Limitations

Risks change as laws, products and data change, so these pages require regular editorial review.