Idea: Can network learn without training, but instead input/output examples?

Memory is a matrix of neurons

$$ r_t <- \sum_i w_t(i)M_t(i) $$

Screen Shot 2024-03-03 at 2.42.30 PM.png

write consists of erase, add

Screen Shot 2024-03-03 at 2.43.12 PM.png

Screen Shot 2024-03-03 at 2.43.24 PM.png

Salience map

Recurrent attention model