2025-02-21 ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model Zhongyi Zhou et.al. 2502.14420 null 2025-02-21 VLAS: Vision-Language-Action Model With Speech ...
Authors: Qihang Zhang, Shuangfei Zhai, Miguel Angel Bautista, Kevin Miao, Alexander Toshev, Joshua Susskind, Jiatao Gu ...