RyanLiu112

Follow

🎯

Focusing

Runze Liu RyanLiu112

🎯

Focusing

Follow

I am Runze Liu, a second-year master's student at Tsinghua University.

20 followers · 15 following

Tsinghua University
Qingdao
05:18 (UTC +08:00)
https://ryanliu112.github.io
https://scholar.google.com/citations?user=LiIfGakAAAAJ

Achievements

Achievements

Highlights

Pro

RyanLiu112/README.md

👋 Hi, I’m Runze Liu, a second-year master student at Tsinghua Unversity.
👀 I’m interested in Large Language Models (LLMs), Reinforcement Learning (RL) and Reinforcement Learning from Human Feedback (RLHF).

Pinned Loading

compute-optimal-tts compute-optimal-tts Public

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 195 17
Awesome-Process-Reward-Models Awesome-Process-Reward-Models Public

A comprehensive collection of process reward models.

5
MRN MRN Public

[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning".

Python 20 5
ChangWinde/RAT ChangWinde/RAT Public

[AAAI 2025 Oral] Official code for "RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors"

Python 10