Constitution

Appears in 1 paper

A written document specifying principles that an AI should follow.

As used in Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

A written document specifying principles that an AI should follow. Anthropic's constitution included 16–18 principles like "Be honest," "Avoid harm," "Be helpful," written in natural language. The constitution is auditable and transparent.