Hard alignment
In older statistical machine translation, each target word was explicitly assigned to exactly one source word.
In older statistical machine translation, each target word was explicitly assigned to exactly one source word. Brittle, requires linguistic knowledge, and breaks for long-range dependencies. Replaced by soft alignment in neural attention models.