SL-CAI (Supervised Learning Constitutional AI)

Appears in 1 paper

The first stage of Constitutional AI.

As used in Paper 22 — Constitutional AI: Harmlessness from AI Feedback →

The first stage of Constitutional AI. Generate harmful outputs, ask the model to critique them against the constitution, collect revisions, and fine-tune on the revisions. This creates a supervised training dataset of (original, revised) pairs.