Learning-Based Robotics: A Complete Guide
Published:

What is a Robot? The Sense-Plan-Act Loop
A robot is more than a mechanical arm or a wheeled platform. Formally, a robot is an embodied agent that (1) perceives its environment through sensors, (2) processes that information to decide what to do, and (3) executes actions that physically change the world.
This sense-plan-act cycle repeats continuously at frequencies ranging from 1 Hz for high-level task planning to 1 kHz for low-level torque control. The three components map directly to the main engineering disciplines:
- Sensing: cameras, LiDAR, IMUs, force/torque sensors, proprioceptive joint encoders.
- Planning: trajectory planning, task planning, SLAM-based localisation and mapping.
- Acting: motor control, PID loops, impedance controllers, whole-body controllers.
Classical robotics treated these as separate, modular blocks with clean interfaces. The learning revolution blurs these boundaries: a neural network that takes raw pixels and outputs joint torques combines all three phases in a single forward pass.
The Learning Revolution in Robotics
Before deep learning, robot behaviours were programmed explicitly. A manipulation policy might consist of thousands of hand-crafted rules covering every anticipated scenario. This approach is brittle: the world is too complex and variable for exhaustive enumeration.
The turning point came with the application of deep neural networks to perception (2012 ImageNet), reinforcement learning to games (2015 DQN, 2016 AlphaGo), and eventually to physical robots. Key milestones include:
- 2016: OpenAI and Google DeepMind demonstrate RL for robotic manipulation in simulation.
- 2019: OpenAIโs Dactyl solves a Rubikโs Cube with a dexterous five-fingered hand trained entirely in simulation.
- 2022: RT-1 trains a Transformer on 130,000 real robot episodes and generalises to novel tasks.
- 2023: Diffusion Policy and RT-2 demonstrate that expressive generative models can represent rich, multi-modal action distributions.
Key Challenges
Despite rapid progress, several fundamental obstacles remain:
Safety and reliability. Robots operate in the physical world where failures have real consequences โ a dropped object, a collision with a human, or an uncontrolled fall. Standard RL maximises expected reward with no formal safety guarantees. Constrained MDPs, barrier functions, and safe exploration algorithms address this but remain research frontiers.
Sample efficiency. Real-robot experiments are slow (1ร real-time), expensive (wear and hardware failures), and hard to parallelise. A simulated game environment provides millions of steps per second; a physical robot provides thousands per day. Sim-to-real transfer, offline RL, and data-efficient learning are active mitigation strategies.
Generalisation. A robot trained to grasp objects on a white table may fail on a cluttered desk. Bridging distribution shift between training and deployment environments requires diverse training data, domain randomisation, and robust representations.
Sim-to-real gap. Simulators are imperfect models of the real world. Contact dynamics, friction, and deformable objects are notoriously hard to simulate accurately, causing policies trained in simulation to fail when deployed on real hardware.
Book Structure
This book is organised into five thematic sections:
- Foundations: kinematics, sensors, and control โ the classical building blocks every roboticist must know.
- Perception: 3D vision, object detection, pose estimation โ how robots understand their environment.
- Planning: SLAM, path planning, task and motion planning โ how robots decide what to do.
- Learning: imitation learning, deep RL for manipulation and locomotion, sim-to-real, hierarchical RL โ data-driven approaches to behaviour.
- Frontier: diffusion policy, foundation models, language-conditioned robots, safety, and open problems โ where the field is heading.
References
- Siciliano, B., Sciavicco, L., Villani, L., & Oriolo, G. (2009). Robotics: Modelling, Planning and Control. Springer.
- Kober, J., Bagnell, J. A., & Peters, J. (2013). Reinforcement learning in robotics: A survey. The International Journal of Robotics Research, 32(11), 1238โ1274.
- Billard, A., & Kragic, D. (2019). Trends and challenges in robot manipulation. Science, 364(6446).
