Kun Lei

RL learner for Robotics
← Return to menu

Blogs

2025.11
A Practical Way to Classify RL: by Data Source and by Update Schedule.
2025.10
A three-stage framework from imitation to iterative offline and last-mile online RL that achieves near-perfect success with high efficiency and robustness.