Sampling Temperature
A hyperparameter in language model decoding that controls randomness.
A hyperparameter in language model decoding that controls randomness. High temperature (e.g., 1.0) produces more diverse, creative generations. Low temperature (e.g., 0.1) produces more deterministic, conservative outputs. Best-of-N typically uses higher temperature to ensure diversity across samples.