In-situ Value-aligned Human-Robot Interactions with Physical Constraints

Hongtao Li1, 2, Ziyuan Jiao2, Xiaofeng Liu1,$^\dagger$, Hangxin Liu2,$^\dagger$, Zilong Zheng2,$^\dagger$
1College of Artificial Intelligence and Automation, Hohai University
2State Key Laboratory of General Artificial Intelligence, BIGAI, Beijing, China

$^\dagger$Indicates the Corresponding Authors

We introduce In-Context Learning from Human Feedback (ICLHF) algorithm that learns human preferences in situ and combines them with physical constraints to accomplish the task.

Abstract

Equipped with Large Language Models (LLMs), human-centered robots are now capable of performing a wide range of tasks that were previously deemed challenging or unattainable. However, merely completing tasks is insufficient for cognitive robots, who should learn and apply human preferences to future scenarios. In this work, we propose a framework that combines human preferences with physical constraints, requiring robots to complete tasks while considering both. Firstly, we developed a benchmark of everyday household activities, which are often evaluated based on specific preferences. We then introduced In-Context Learning from Human Feedback (ICLHF), where human feedback comes from direct instructions and adjustments made intentionally or unintentionally in daily life. Extensive sets of experiments, testing the ICLHF to generate task plans and balance physical constraints with preferences, have demonstrated the efficiency of our approach.

Approach


A Brief Overview of ICLHF Workflow

ICLHF algorithm includes five steps:
  1. Make initial plan with user's instruction and summarized preference
  2. Three process for the preliminary planning of LLMs
  3. Use POG, an algorithm for efficient sequential manipulation planning on the scene graph, to refine the details of the planning
  4. Execute the plan and get physical feedback
  5. Reflect human preference and update the plan


A Simulated Demonstration Case

Results


Simulation Experiments

Messy Physical Preference Physical + Preference


Real Robot Experiments

BibTeX

BibTex Code Here