Skip to content
View shengpu-tang's full-sized avatar

Highlights

  • Pro

Organizations

@MLD3

Block or report shengpu-tang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. MLD3/CounterfactualAnnot-SemiOPE MLD3/CounterfactualAnnot-SemiOPE Public

    [NeurIPS 2023] Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation. https://arxiv.org/abs/2310.17146

    Jupyter Notebook 1 1

  2. MLD3/OfflineRL_FactoredActions MLD3/OfflineRL_FactoredActions Public

    [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare.

    Jupyter Notebook 9

  3. MLD3/OfflineRL_ModelSelection MLD3/OfflineRL_ModelSelection Public

    [MLHC 2021] Model Selection for Offline RL: Practical Considerations for Healthcare Settings. https://arxiv.org/abs/2107.11003

    Jupyter Notebook 8 5

  4. MLD3/RL-Set-Valued-Policy MLD3/RL-Set-Valued-Policy Public

    [ICML 2020] Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies. https://arxiv.org/abs/2007.12678, https://icml.cc/virtual/2020/poster/5797

    Jupyter Notebook 16 3

  5. MLD3/FIDDLE MLD3/FIDDLE Public

    FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algorithms. https://doi.org/10.1093/jamia/ocaa139

    Jupyter Notebook 89 19

  6. microsoft/rl-offline-simulation microsoft/rl-offline-simulation Public

    Data-driven offline simulation for online reinforcement learning: benchmark and baselines

    Python 27 6