You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When looking at the code of the step method of the SwimmerEnv I noticed that the velocity along the x axis used in the reward is computed from the positions:
However I would have expected the velocity to be taken directly from self.sim.data.qvel[0] but when looking at the values of self.sim.data.qvel[0] they are not equal to reward_fwd. Is there a good reason to compute the velocity from the positions and not use self.sim.data.qvel[0]?
PR #2762 is about to be merged, introducing V4 MuJoCo environments using new bindings and a dramatically newer version of the engine. If this issue still persists with the V4 ones, please create a new issue for it.
When looking at the code of the
step
method of theSwimmerEnv
I noticed that the velocity along the x axis used in the reward is computed from the positions:However I would have expected the velocity to be taken directly from
self.sim.data.qvel[0]
but when looking at the values ofself.sim.data.qvel[0]
they are not equal toreward_fwd
. Is there a good reason to compute the velocity from the positions and not useself.sim.data.qvel[0]
?There is a similar question on the MuJoCo forum.
The text was updated successfully, but these errors were encountered: