Here are ideas I like and want to add. Hopefully the reviewers will ask some of the questions I expect.
- Max entopy exploration
- ELBO/count-based exploration
- Extrinsic reward exploration
- Train a VAE->MLP classifier with
- random batches
- softmax E batches
- argmax E batches
- Finetune a pretained-SOA model using determined curiosity?
- Can curiosity further improve the state of the art?
- Leaning w/ VAE
- A grid-maze
- A scent landscape
- A patch world