The mathematics behind RW1972

The most influential associative learning model, RW1972 (Rescorla & Wagner, 1972), learns from global error and posits no changes in stimulus associability.

1 - Generating expectations

Let v_k, j denote the associative strength from stimulus k to stimulus j. On any given trial, the expectation of stimulus j, e_j, is given by:

$$ \tag{Eq.1} e_j = \sum_{k}^{K}x_k v_{k,j} $$

x_k denotes the presence (1) or absence (0) of stimulus k, and the set K represents all stimuli in the design.

2 - Learning associations

Changes to the association from stimulus i to j, v_i, j, are given by:

Δv_i, j = α_iβ_j(λ_j − e_j)

where α_i is the associability of stimulus i, β_j is a learning rate parameter determined by the properties of j¹, and λ_j is a the maximum association strength supported by j (the asymptote).

3 - Generating responses

There is no specification of response-generating mechanisms in RW1972. However, the simplest response function that can be adopted is the identity function on stimulus expectations. If so, the responses reflecting the nature of j, r_j, are given by:

r_j = e_j

References

Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. (pp. 64–69). Appleton-Century-Crofts.

The implementation of RW1972 allows the specification of independent β values for present and absent stimuli (beta_on and beta_off, respectively).↩︎

- The mathematics behind RW1972