Token
A unit of text, roughly a word or subword.
A unit of text, roughly a word or subword. GPT-3 uses a vocabulary of ~50,000 tokens. Longer sequences (more words) = more tokens = higher computational cost. The context window (~2,000 tokens) limits how much text you can input at once.
A unit of text, roughly a word or subword. GPT-2 uses a 50K-token vocabulary; tokens are the building blocks of training data and model inputs.