Hold off to do for reviewers?

Here are ideas I like and want to add. Hopefully the reviewers will ask some of the questions I expect.

Other agents

Max entopy exploration
ELBO/count-based exploration
Extrinsic reward exploration

Curiosity cat?

Train a VAE->MLP classifier with
- random batches
- softmax E batches
- argmax E batches
Finetune a pretained-SOA model using determined curiosity?
- Can curiosity further improve the state of the art?

ImageInfoBandits

Leaning w/ VAE

Foraging tasks

A grid-maze
A scent landscape
A patch world