Kun Lei

RL learner for Robotics
← Return to menu

Blogs

2025.10
A three-stage framework from imitation to iterative offline and last-mile online RL that achieves near-perfect success with high efficiency and robustness.