Humanoidbench RL#

We provide a basic RL training example for Humanoidbench tasks.

RL framework: stable-baselines3

RL algorithm: PPO

Simulator: MuJoCo and IsaacGym and IsaacLab

Installation#

pip install stable-baselines3
pip install wandb
pip install tensorboard

Wandb login, enter your wandb account token.

wandb login

NOTE: You can add --use_wandb --wandb_entity <your_wandb_entity_name> to use wandb to log the training process.

MuJoCo:

python roboverse_learn/humanoidbench_rl/train_sb3.py --sim mujoco --num_envs 1 --robot=h1 --task humanoidbench:Stand

IsaacGym:

python roboverse_learn/humanoidbench_rl/train_sb3.py --sim isaacgym --num_envs 2 --robot=h1 --task humanoidbench:Stand

IsaacLab:

python roboverse_learn/humanoidbench_rl/train_sb3.py --sim isaaclab --num_envs 2 --robot=h1 --task humanoidbench:Stand