[CLS] token
A special token prepended to every BERT input.
A special token prepended to every BERT input. Its final hidden state is a single vector summarising the entire sequence. Used as input to the classifier head for sentence-level tasks (sentiment, NSP, entailment).