Skip to content

Commit dd06163

Browse files
authored
Merge pull request #221 from huangshiyu13/main
update
2 parents fdcbbe2 + 04bebb7 commit dd06163

File tree

6 files changed

+32
-10
lines changed

6 files changed

+32
-10
lines changed

README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ Currently, the features supported by OpenRL include:
5959
- Reinforcement learning training support for natural language tasks (such as dialogue)
6060

6161
- Support [Arena](https://openrl-docs.readthedocs.io/en/latest/arena/index.html) , which allows convenient evaluation of
62-
various agents in a competitive environment.
62+
various agents (even submissions for [JiDi](https://openrl-docs.readthedocs.io/en/latest/arena/index.html#performing-local-evaluation-of-agents-submitted-to-the-jidi-platform-using-openrl)) in a competitive environment.
6363

6464
- Importing models and datasets from [Hugging Face](https://huggingface.co/)
6565

README_zh.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ OpenRL基于PyTorch进行开发,目标是为强化学习研究社区提供一
5151
- 支持通过专家数据进行离线强化学习训练
5252
- 支持自博弈训练
5353
- 支持自然语言任务(如对话任务)的强化学习训练
54-
- 支持[竞技场](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html)功能,可以在多智能体对抗性环境中方便地对各种智能体进行评测
54+
- 支持[竞技场](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html)功能,可以在多智能体对抗性环境中方便地对各种智能体(甚至是[及第平台](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html#openrl)上提交的智能体)进行评测
5555
- 支持从[Hugging Face](https://huggingface.co/)上导入模型和数据
5656
- 提供用户自有环境接入OpenRL的[详细教程](https://openrl-docs.readthedocs.io/zh/latest/custom_env/index.html).
5757
- 支持LSTM,GRU,Transformer等模型

examples/arena/README.md

+13-2
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,17 @@
11

2-
## Usage
2+
### Installation
3+
4+
```bash
5+
pip install "openrl[selfplay]"
6+
```
7+
8+
### Usage
39

410
```shell
511
python run_arena.py
6-
```
12+
```
13+
14+
15+
### Evaluate Google Research Football submissions for JiDi locally
16+
17+
If you want to evaluate your Google Research Football submissions for JiDi locally, please try to use tizero as illustrated [here](foothttps://github.com/OpenRL-Lab/TiZero#evaluate-jidi-submissions-locally).

examples/gfootball/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ This is the guidance for [Google Research Football](https://github.com/google-re
88

99
### Evaluate JiDi submissions locally
1010

11-
If you want to evaluate your JiDi submissions locally, please try to use tizero as illustrated [here](https://github.com/OpenRL-Lab/TiZero#evaluate-jidi-submissions-locally).
11+
If you want to evaluate your JiDi submissions locally, please try to use tizero as illustrated [here](foothttps://github.com/OpenRL-Lab/TiZero#evaluate-jidi-submissions-locally).
1212

1313

1414
### Convert dump file to video

examples/selfplay/README.md

+7-2
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,15 @@ python train_selfplay.py
1313
```
1414

1515

16-
## Play with a trained agent
16+
## Play with a Trained Agent
1717

1818
Users can play with a trained agent via:
1919

2020
```shell
2121
python human_vs_agent.py
22-
```
22+
```
23+
24+
25+
## Evaluate Trained Agents
26+
27+
If you want to evaluate your trained agents, please try to use OpenRL Arena as illustrated [here](https://openrl-docs.readthedocs.io/en/latest/arena/index.html)

examples/snake/README.md

+9-3
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,19 @@
11

22
This is the example for the snake game.
33

4-
## Installation
4+
### Installation
55

66
```bash
77
pip install "openrl[selfplay]"
88
```
99

10-
## Usage
10+
### Usage
1111

1212
```bash
1313
python train_selfplay.py
1414
```
1515

16-
## Evaluate JiDi submissions locally
16+
### Evaluate JiDi submissions locally
1717

1818
```bash
1919
python jidi_eval.py
@@ -26,3 +26,9 @@ Submition site: http://www.jidiai.cn/env_detail?envid=1.
2626
Snake senarios: [here](https://github.com/jidiai/ai_lib/blob/7a6986f0cb543994277103dbf605e9575d59edd6/env/config.json#L94)
2727
Original Snake environment: [here](https://github.com/jidiai/ai_lib/blob/master/env/snakes.py)
2828

29+
30+
31+
32+
### Evaluate Google Research Football submissions for JiDi locally
33+
34+
If you want to evaluate your Google Research Football submissions for JiDi locally, please try to use tizero as illustrated [here](foothttps://github.com/OpenRL-Lab/TiZero#evaluate-jidi-submissions-locally).

0 commit comments

Comments
 (0)