• Home
  • Current congress
  • Public Website
  • My papers
  • root
  • browse
  • IAC-23
  • B6
  • 2
  • paper
  • A learning-based robotic refueling control system for on-orbit service

    Paper number

    IAC-23,B6,2,12,x77497

    Author

    Prof. Yong Chun Xie, Beijing Institute of Control Engineering, China Academy of Space Technology (CAST), China

    Coauthor

    Dr. Yong Wang, Beijing Institute of Control Engineering, China Academy of Space Technology (CAST), China

    Coauthor

    Mr. Linfeng Li, Beijing Institute of Control Engineering, China Academy of Space Technology (CAST), China

    Coauthor

    Mrs. Ao Chen, Beijing Institute of Control Engineering, China Academy of Space Technology (CAST), China

    Coauthor

    Dr. Na Yao, Qian Xuesen Laboratory of Space Technology, China Academy of Space Technology (CAST), China

    Year

    2023

    Abstract
    On-orbit service (OOS) is a type of operation that the human or robot server carries out on the client spacecraft in the space environment. Typical OOS includes propellant replenishment, component replacement, maintenance, etc. Among them, robotic refueling is a long-studied yet promising technology to increment a spacecraft's service duration. The main difficulty in space robotic refueling is the high requirement of manipulation versatility in an unstructured on-orbit environment. Learning-based methods, especially deep learning with highly expressive deep neural networks, can be leveraged to upgrade the autonomy of robotic refueling. In this paper, we design and implement an autonomous robotic refueling control system based on deep reinforcement leaming. This paper depicts the framework including intelligent perception, autonomous planning and visual servo control, and the physical experiment system. Validation in both simulation and real environments shows that the system is able to execute numerous basic manipulation tasks like lid opening, injector docking, and refueling. Robustness and adaptability to an active interaction with dynamic time-varying environments are also demonstrated. The paper has three main technical contributions: (1) Overcoming sparse reward by learning sequential generative sub-goals. The method shows generalization over a variety of elementary tasks for space robotic refueling including lid opening, injector docking, refueling, etc; (2) Design of a laboratory learning/training platform for space operation for efficient learning data sampling, which has high similarity with the on-orbit environment. The learned policy has a good property of Sim-to-Real(Sim2Real) transfer, verifying the possibility that further application in the practical on-orbit environment; (3) A sufficient validation of robustness to robot base/goal perturbation and luminance intensity change, meeting the practical needs in engineering.
    Abstract document

    IAC-23,B6,2,12,x77497.brief.pdf

    Manuscript document

    IAC-23,B6,2,12,x77497.pdf (🔒 authorized access only).

    To get the manuscript, please contact IAF Secretariat.