Robot Applications
Manipulation
Grasping
- SAC for 6-DOF grasp learning
- Image-based policy
- Curriculum learning
In-hand Manipulation
- Dexterous hand control
- DAPG (Dagger)
Locomotion
Walking
- PPO for quadruped control
- Reward shaping
- Domain randomization
Swimming/Flying
- Sim-to-real challenges
- Residual policies
Best Practices
- Curriculum learning: Start simple, increase difficulty
- Observation normalization: Improve training stability
- Action scaling: Match environment limits
- Domain randomization: Bridge sim-to-real gap
← Back to Index