optimneuralbandits

Neural Bandits with an optimizer to generate a set of contexts to evaluate at every turn (possibly other optimizers, the code is pretty agnostic in that regard), to generate candidate vectors to evaluate/pull. Used in the detection of potentially inappropriate polypharmacies.

Our Paper: Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy

Original versions of NeuralUCB / TS that are modified here: https://github.com/uclaml/NeuralTS

Includes:

NeuralTS and NeuralUCB
NeuralTS with Dropout
NeuralTS with Lenient Regret

Relevant papers for ideas implemented in this repo:

Zhang, Weitong, et al. "Neural thompson sampling." arXiv preprint arXiv:2010.00827 (2020).
Zhou, Dongruo, Lihong Li, and Quanquan Gu. "Neural contextual bandits with ucb-based exploration." International Conference on Machine Learning. PMLR, 2020.
Riquelme, Carlos, George Tucker, and Jasper Snoek. "Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling." arXiv preprint arXiv:1802.09127 (2018).
Merlis, Nadav, and Shie Mannor. "Lenient regret for multi-armed bandits." arXiv preprint arXiv:2008.03959 (2020).

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
supervised		supervised
testing		testing
.gitignore		.gitignore
README.md		README.md
datasets.py		datasets.py
networks.py		networks.py
optimneuralts.py		optimneuralts.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

optimneuralbandits

About

Releases

Packages

Languages

GRAAL-Research/optimneuralbandits

Folders and files

Latest commit

History

Repository files navigation

optimneuralbandits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages