Activation Function

Appears in 1 paper

A non-linear function applied to a neuron's weighted sum before passing the result to the next layer.

As used in Paper 03 — Learning Representations by Back-propagating Errors →

A non-linear function applied to a neuron's weighted sum before passing the result to the next layer. Without activation functions, a multi-layer network collapses to a single linear function (no matter how many layers). Common choices: sigmoid (used in 1986 paper), tanh, ReLU (standard today).

Paper 03 — Learning Representations by Back-propagating Errors →

Appears in papers