Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Installation

We assume that the following commands are run in a python virtual environment.

Install gym-minigrid from this fork

git clone git@github.com:tianyudwang/Minigrid.git
cd Minigrid
pip install -e .
cd ..

Download pybind11

git clone git@github.com:pybind/pybind11.git

Build the A* c++ code

cd astar_cpp
mkdir build
cd build
cmake ..
make

Return to project page and link

cd ../..
export PYTHONPATH=./astar_cpp/lib:${PYTHONPATH}

Generate dataset

python3 scripts/expert_policy.py --grid_size 16

The generated trajectories are under demonstration/ folder

Run training

python3 scripts/train.py --grid_size 16

The tensorboard logs with training metrics are in the logs/ folder and can be opened with

tensorboard --logdir logs

Run testing To run testing with pretrained models in the trained_models/ folder

python3 scripts/test.py --grid_size 16

This shows the success rate of rolling out the trained policy at each state.

Here are some visualizations of test episodes in 64 by 64 maps

Bibtex

@article{Wang2021sirl,
         author = {Wang, Tianyu and Dhiman, Vikas and Atanasov, Nikolay},
         title = {Inverse Reinforcement Learning for Autonomous Navigation 
                  via Differentiable Semantic Mapping and Planning},
         journal={arXiv preprint arXiv:2101.00186},
         year={2021}
         }

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
astar_cpp		astar_cpp
docs		docs
pybind11 @ 5bbcba5		pybind11 @ 5bbcba5
scripts		scripts
trained_models/MiniGrid-LavaLawnS16-v0		trained_models/MiniGrid-LavaLawnS16-v0
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Installation

Bibtex

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Inverse reinforcement learning for autonomous navigation via differentiable semantic mapping and planning

Installation

Bibtex

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages