Robots used to explore by following simple rules or short-term rewards, which often made them waste time and backtrack a lot.
Robots usually learn by copying many demonstrations, which is expensive and makes them brittle when things change.