視覚運動方策において固有受容状態は必要か？

なぜ重要か: 企業や社会への影響が見込まれ、一般メディアにも波及する可能性があります。

ソースを読む（export.arxiv.org）

arXiv:2509.18644v2 発表種別: replace-cross

要旨: 模倣学習に基づく視覚運動ポリシーは、ロボット操作において広く用いられており、通常、精密な制御のために視覚的観測と固有受容状態の両方が併用される。しかしながら、本研究において、この一般的な手法はポリシーを固有受容状態入力に過度に依存させること、ひいては訓練軌跡への過学習を引き起こし、空間的な汎化性能を低下させることを明らかにした。これに対し、本研究では、固有受容状態入力を除去し、視覚的観測のみを条件として行動を予測する状態自由ポリシーを提案する。状態自由ポリシーは、相対的なエンドエフェクタ動作空間で構築され、ここでは双方の広角手首カメラによって提供される、タスクに関連する全ての視覚的観測を確保する必要がある。実験結果は、状態自由ポリシーが状態ベースポリシーよりも大幅に優れた空間的汎化性能を実現することを示している。ピックアンドプレース、シャツ折り畳み、複雑な全身操作といった現実世界のタスクにおいて、複数のロボット形態にわたって、高さ方向の汎化における成功率は0%から85%、水平方向の汎化における成功率は6%から64%に向上した。さらに、データ効率とクロスエンボディメント適応においても利点があり、現実世界への展開における実用性を高めている。詳細はhttps://statefreepolicy.github.io を参照のこと。

原文（英語）を表示

Title (EN): Do You Need Proprioceptive States in Visuomotor Policies?

arXiv:2509.18644v2 Announce Type: replace-cross
Abstract: Imitation-learning-based visuomotor policies have been widely used in robot manipulation, where both visual observations and proprioceptive states are typically adopted together for precise control. However, in this study, we find that this common practice makes the policy overly reliant on the proprioceptive state input, which causes overfitting to the training trajectories and results in poor spatial generalization. On the contrary, we propose the State-free Policy, removing the proprioceptive state input and predicting actions only conditioned on visual observations. The State-free Policy is built in the relative end-effector action space, and should ensure the full task-relevant visual observations, here provided by dual wide-angle wrist cameras. Empirical results demonstrate that the State-free policy achieves significantly stronger spatial generalization than the state-based policy: in real-world tasks such as pick-and-place, challenging shirt-folding, and complex whole-body manipulation, spanning multiple robot embodiments, the average success rate improves from 0% to 85% in height generalization and from 6% to 64% in horizontal generalization. Furthermore, they also show advantages in data efficiency and cross-embodiment adaptation, enhancing their practicality for real-world deployment. Discover more by visiting: https://statefreepolicy.github.io.

Published: 2025-09-24 19:00 UTC

視覚運動方策において固有受容状態は必要か？

コメントする コメントをキャンセル

コメントするコメントをキャンセル