Constitutional AI (CAI)

Appears in 1 paper

An alignment methodology that replaces human feedback with AI feedback.

As used in Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

An alignment methodology that replaces human feedback with AI feedback. It consists of SL-CAI (supervised learning stage using self-critique and revision) and RL-CAI (reinforcement learning stage using AI-generated preferences).

Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

Appears in papers