GRPO-robot

Environment:

conda create -n grpo-robot -c conda-forge python=3.7 pybox2d
conda activate grpo-robot
pip install numpy<1.19.5 gym<=0.25.2 torch matplotlib pygame seaborn tqdm

Run:

# 训练
python ./ppo_train_cartpole.py

# 将训练好的策略模型可视化
python ./强化学习过程可视化-cartpole.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
doc		doc
weights		weights
.gitignore		.gitignore
README.md		README.md
grpo_test_cartpole.py		grpo_test_cartpole.py
grpo_test_pendulum.py		grpo_test_pendulum.py
grpo_train_cartpole.py		grpo_train_cartpole.py
grpo_train_pendulum.py		grpo_train_pendulum.py
ppo_test_cartpole.py		ppo_test_cartpole.py
ppo_test_pendulum.py		ppo_test_pendulum.py
ppo_train_cartpole.py		ppo_train_cartpole.py
ppo_train_pendulum.py		ppo_train_pendulum.py
requirements-envconfig-example.sh		requirements-envconfig-example.sh
requirements.txt		requirements.txt
rl_utils.py		rl_utils.py
强化学习过程可视化-cartpole.py		强化学习过程可视化-cartpole.py
强化学习过程可视化-pendulum.py		强化学习过程可视化-pendulum.py
状态值函数可视化 x.py		状态值函数可视化 x.py
解释性工具SHAP x.py		解释性工具SHAP x.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GRPO-robot

Environment:

Run:

一些有用的参考

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GRPO-robot

Environment:

Run:

一些有用的参考

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages