Paper 06

Further reading — Seq2Seq (2014)

Further reading — Paper 06

Blogs, videos, code, and Indian-language resources. Start at the top and work down.

The original papers

  • Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to Sequence Learning with Neural Networks. NeurIPS. https://arxiv.org/abs/1409.3215 The paper we just read. Remarkably readable compared to modern papers — pay special attention to their explanation of the reverse-input trick in the introduction.

  • Cho, K. et al. (2014). Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation. EMNLP. https://arxiv.org/abs/1406.1078 Published a few months before Sutskever’s paper, introduced the encoder-decoder idea using GRUs instead of LSTMs. Complementary reading.

Blog posts

Videos

Code and tutorials

Indian-language projects you can try

  • AI4Bharat — IndicTrans2. https://github.com/AI4Bharat/IndicTrans2 Translation models across 22 scheduled Indian languages, open-source and pretrained. Successor of this paper’s architecture.

  • AI4Bharat — IndicBART. https://github.com/AI4Bharat/indic-bart An encoder-decoder Transformer pre-trained on Indian languages — modern descendant of seq2seq.

  • Bhashini (National Language Translation Mission). https://bhashini.gov.in/ India’s government-backed translation platform. Read about the Digital India Bhashini Division (DIBD) and the underlying tech.

  • Samanantar corpus (AI4Bharat). https://ai4bharat.iitm.ac.in/samanantar 49 million parallel sentence pairs across 11 Indian languages. Excellent training dataset if you want to train your own seq2seq model from scratch.

  • iNLTK (Indic NLP Toolkit). https://inltk.readthedocs.io/ Simple Python API including translation utilities for Hindi, Tamil, Bengali, and more.

Academic resources in India

  • IIT Madras — AI4Bharat. https://ai4bharat.org/ Home of the state-of-the-art open-source Indian-language NLP work.

  • IIT Bombay — CFILT (Center for Indian Language Technology). http://www.cfilt.iitb.ac.in/ Long history of Indian-language MT, including seq2seq-era work and the Hindi WordNet.

  • IIIT Hyderabad — LTRC (Language Technologies Research Centre). http://ltrc.iiit.ac.in/ Parsing and translation research for Indian languages.

Reading order to understand modern NLP

If you’re continuing through this series:

  1. ✅ Paper 06 (Seq2Seq) — you just finished this.
  2. Paper 07 (Bahdanau Attention) — patch seq2seq’s bottleneck.
  3. Paper 08 (Transformer) — throw out LSTMs, keep only attention.
  4. Paper 10 (GPT-1) and Paper 11 (BERT) — the two faces of modern pretraining.

Back to Paper 06 home · Glossary · Quiz.