Instruction-Following

Appears in 1 paper

The ability of a language model to accurately follow user instructions and respond helpfully.

As used in Paper 15 — Training Language Models to Follow Instructions with Human Feedback →

The ability of a language model to accurately follow user instructions and respond helpfully. Emerges from RLHF training on preferences that reward clarity, relevance, and instruction adherence. InstructGPT demonstrates strong instruction-following while GPT-3 often ignores or misinterprets instructions.

Paper 15 — Training Language Models to Follow Instructions with Human Feedback →

Appears in papers