Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Meeting w/ Abhishek #9

Open
hjsuh94 opened this issue Mar 29, 2023 · 0 comments
Open

Meeting w/ Abhishek #9

hjsuh94 opened this issue Mar 29, 2023 · 0 comments

Comments

@hjsuh94
Copy link
Owner

hjsuh94 commented Mar 29, 2023

  1. The fundamental claim is that score matching / diffusion models can draw you to within-data distribution regimes.
  • what is the extent to which I can get o.o.d. before diffusion models start to be less effective?
  • especially on high dimensions, is this true?
  • people have thought about doing this for anomaly detection, maybe it's interesting to make connections:
  • https://openreview.net/forum?id=5tKhUU5WBi8
  • https://arxiv.org/pdf/2211.07740.pdf
  • This claims is generalizable beyond the setting of model-based offline RL, and would apply to any kind of model-based optimization? (where we have uncertainty over cost functions)
  1. Why use score-based generative models as opposed to ensembles?

TODOs:

  • Produce a simple example of a diffusion-based score field trying to pull samples within distribution.
  • Produce a simple example where diffusion-based score fields to do better than ensembles.
  • Compare with CQL: why do model-based?

Benchmarks:

  • If we have some convincing cases where this beats ensembles, might not be too necessary.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant