[MASK] token
The special token used to replace selected tokens during MLM pre-training.
The special token used to replace selected tokens during MLM pre-training. The model must predict what the original token was. [MASK] never appears during fine-tuning or inference.