Context vector
Also called the **thought vector**.
Also called the thought vector. In a standard seq2seq model, it is
A vector computed at each decoding step t as the attention-weighted sum of all encoder hidden states: cₜ = Σᵢ αₜᵢ hᵢ. Unlike the seq2seq context vector (fixed), this is different at every decoding step — it is a dynamic, query-dependent read of the source.