Caesar107

Follow

🎯

Focusing

Menglin Zou Caesar107

🎯

Focusing

Follow

2 followers · 0 following

university of auckland

Achievements

Achievements

Highlights

Pro

Pinned Loading

PolynomialTime/TRRL PolynomialTime/TRRL Public

Trust Region Reward Learning

Python 3 2
IQ-learn IQ-learn Public

Improved version of IQ-Learn: added KL divergence and reward as baselines, adapted to Gym Atari and MuJoCo environments.

Python
TRRL TRRL Public

Forked from PolynomialTime/TRRL

Trust Region Reward Learning

Python
HyPE HyPE Public

An adapted version of the HYPE algorithm for Gym, MuJoCo, and Atari environments

Python