标签:retracing-rollout

WHALE来了,南大周志华团队做出更强泛化的世界模型

南京大学和南栖仙策的研究者们提出了WHALE(World models with beHavior-conditioning and retrAcing-rollout LEarning)框架,旨在学习可泛化的具身决策世界...