[IROS 2024] Public code and model for IR2: Implicit Rendezvous for Robotic Exploration Teams under Sparse Intermittent Connectivity.
Public Website: https://ir2-explore.github.io/
We present IR2, a deep reinforcement learning approach to information sharing for multi-robot exporation under communication constraints. Leveraging attention-based neural networks and hierarchical graph formulation, robots can effectively balance the longer-term trade-offs between disconnecting for solo exploration and reconnecting for information sharing.
We show that our model trained in this python simulation can generalize well to complex large-scale environments, such as those in Gazebo simulation and real world experiments. We hope that our work will be a useful learning-based benchmark for future research.
This demonstration showcases 4 robots exploring in an unknown Complex map under line-of-sight signal strength communication constraints. The top gif illustrates the global map and robot positions assuming no communication constraints. Conversely, the bottom 4 gifs illustrates the individual robots' map and position beliefs subjected to communication constraints.
If this GIF is taking too long to load, you may view the demonstration here.
This repository was tested using the following dependencies. Newer version of these packages may work as well.
python == 3.8pytorch == 1.10.0ray == 1.10.0scikit-image == 0.19.3scikit-learn == 1.2.1scipy == 1.10.0matplotlib == 3.6.3tensorboard == 2.8.0
- Set training parameters in
parameters.py. - Run python
driver.py.
- Set inference parameters in
test_parameters.py. - Run
test_driver.py.
parameter.pyTraining parameters.driver.pyDriver of training program, maintain & update the global network.runner.pyWrapper of the local network.multi_robot_worker.pyInteract with environment and collect episode experience.model.pyDefine attention-based network.env.pyAutonomous exploration environment.graph_generator.pyGenerate and update the collision-free graph.graph.pyGraph definition and utilities.node.pyInitialize and update nodes in the coliision-free graph.sensor.pySimulate the sensor model of Lidar.robot.pyActs as a replay buffer.ss_realistic_model.pyRealistic signal strength communication model./modelTrained model./DungeonMapsMaps of training environments.
If you intend to use our work in your research, please cite the following publication:
@inproceedings{tan2024ir,
title={Ir 2: Implicit rendezvous for robotic exploration teams under sparse intermittent connectivity},
author={Tan, Derek Ming Siang and Ma, Yixiao and Liang, Jingsong and Chng, Yi Cheng and Cao, Yuhong and Sartoretti, Guillaume},
booktitle={2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
pages={13245--13252},
year={2024},
organization={IEEE}
}Derek Ming Siang Tan
Yixiao Ma
Jingsong Liang
Yi Cheng Chng
Yuhong Cao
Guillaume Sartoretti
