Papers2

All Beginner Intermediate Advanced

All Sources arXiv

#Unitree G1

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Intermediate

Runpei Dong, Ziyan Li et al.Feb 18arXiv

This paper teaches a humanoid robot to find and pick up many different objects in new places using plain-language requests like 'grab the orange mug.'

#humanoid loco-manipulation#end-effector tracking#open-vocabulary perception

Not triaged yet

EgoActor: Grounding Task Planning into Spatial-aware Egocentric Actions for Humanoid Robots via Visual-Language Models

Intermediate

Yu Bai, MingMing Yu et al.Feb 4arXiv

EgoActor is a vision-language model that turns everyday instructions like 'Go to the door and say hi' into step-by-step, egocentric actions a humanoid robot can actually do.

#EgoActing#vision-language model#humanoid robot

Not triaged yet