Recurrent Neural Networks

What

Neural nets with a loop — output at each step feeds back as input to the next step. Designed for sequential data (text, time series).

h_t = activation(W_h × h_{t-1} + W_x × x_t + b)

Each hidden state h_t is a function of the current input AND the previous hidden state → memory of past inputs.

Adds gates (forget, input, output) to control what to remember and forget. Solves the vanishing gradient problem for sequences.

Simplified LSTM with fewer gates. Similar performance, fewer parameters.

Transformers have replaced RNNs for most tasks because: