Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the value for the available_actions #71

Open
EddyJason opened this issue Mar 20, 2023 · 3 comments
Open

the value for the available_actions #71

EddyJason opened this issue Mar 20, 2023 · 3 comments

Comments

@EddyJason
Copy link

why the value of the available_actions is the array of (12800, 5) and the value of each row is [1, 1, 1, 1, 1]. 1 simply indicate whether or not it simply indicates whether or not it is moving, but does not consider the direction of movement.

@zoeyuchao
Copy link
Member

Okay. You have created so many issues and I can not figure out which one I should answer, could you summarize your questions clearly? Chinese is okay.

@EddyJason
Copy link
Author

EddyJason commented Mar 30, 2023

我想知道动作空间中0与1的区别,0是否代表不动,1是否代表移动
还有.sh文件的运行,经过terminal运行,但是程序报错无法找到模型
还有一个新的问题:
def observation(self, agent, world):
# goal positions
# goal_pos = [np.zeros(world.dim_p), np.zeros(world.dim_p)]
# if agent.goal_a is not None:
# goal_pos[0] = agent.goal_a.state.p_pos - agent.state.p_pos
# if agent.goal_b is not None:
# goal_pos[1] = agent.goal_b.state.p_pos - agent.state.p_pos
# goal color
goal_color = [np.zeros(world.dim_color), np.zeros(world.dim_color)]
# if agent.goal_a is not None:
# goal_color[0] = agent.goal_a.color
if agent.goal_b is not None:
goal_color[1] = agent.goal_b.color

    # get positions of all entities in this agent's reference frame
    entity_pos = []
    for entity in world.landmarks:  # world.entities:
        entity_pos.append(entity.state.p_pos - agent.state.p_pos)
    # entity colors
    entity_color = []
    for entity in world.landmarks:  # world.entities:
        entity_color.append(entity.color)
    # communication of all other agents
    comm = []
    for other in world.agents:
        if other is agent:
            continue
        comm.append(other.state.c)
    return np.concatenate([agent.state.p_vel] + entity_pos + [goal_color[1]] + comm)

在观测空间中,为什么返回的是p_vel而不是p_pos

@EddyJason
Copy link
Author

EddyJason commented Apr 10, 2023

一个新的问题是:
为什么产生64个不同的世界,只产生一个世界,直接计算轨迹不是更快吗?
为什么动作空间是5位数组?其中四位应该是东南西北,但是第五位是序号吗?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants