DoorBot: Closed-Loop Task Planning and Manipulation for Door Opening in the Wild with Haptic Feedback

This is the official repository for the paper: DoorBot: Closed-Loop Task Planning and Manipulation for Door Opening in the Wild with Haptic Feedback.

[Website] [paper] [video]

Proposed DoorBot, a haptic-aware closed-loop hierarchical control framework that enables robots to explore and open different unseen doors in the wild. We test our system on 20 unseen doors across different buildings, featuring diverse appearances and mechanical types. Our framework achieves a 90% success rate, demonstrating its ability to generalize and robustly handle varied door-opening tasks.

1. Hardware Setup

The hardware setup and issues log for the RealMan Mobile Bimanual Humanoid Robot can be found in RealMan Hardware Doc

You can check RealMan Software Doc to learn how to control different parts of the robot using python3 API code.

Visual appearance and configurations of our bimanual mobile robot.

2. Door Opening

Server

DTSAM / RANSAC / GUM need to run on the server. You can check server.py and cfg_server.yaml to setup the server. Then get the code to the server:

conda create -n doorbot python=3.8
conda activate doorbot
cd /your/root/path/
git clone https://github.com/TX-Leo/DoorBot.git

You can check dtsam_package, ransac_package, and gum_package to set up the dependencies.

Local

Get the code:

conda create -n doorbot python=3.8
conda activate doorbot
cd /your/root/path/
git clone https://github.com/TX-Leo/DoorBot.git

Opening-door task:

cd open_door

For testing:

python main.py -n 0 -t lever

It will do the action step by step.

open_loop:

python open_loop.py -n 0 -t lever

close-loop:

python close_loop_SM_GUM.py -n 0 -t lever

3. Primitives Design

We design six motion primitives based on the key steps of opening doors and implement them through low-level controllers. This reduces the dimensionality of the action space and avoids reliance on extensive human expert data.

System Architecture of DoorBot. Our High-Level Planner.

The high-level planner (state machine) and the low-level controllers (6 primitives) are implemented in the primitive.py file.

You can check out prompt.py for the textual input for VLM including high-level and low-level.

4. Grasping-and-Unlocking Model (GUM)

GUM refines the model-based grasp pose prior for the grasp primitive, and simultaneously predicts the motion trajectory for unlocking. It takes RGB and mask images of the handle as input and outputs the adjusted grasp offset (dx, dy) and the unlock axis direction (R). The model is trained on a combination of internet data and real-world data. This allows it to generalize effectively to unseen scenarios.

Dataset

We create a dataset of 1,303 images featuring various door handles, collected from the Internet and real-world photos. The dataset includes four common handle types: lever handles, doorknobs, crossbars, and cabinet handles. Based on object masks generated by Detic and SAM, we manually label the appropriate grasp point and rotation parameters on the images.

GUM Dataset (Original): 1303
- Internet: 766
- Real-World: 537
  - xxx.HEIC: rgb image
  - xxx.jpg: rgb image
  - xxx_mask.png: handle mask image
  - xxx.json: bounding box info of handle (w,h,box,Cx,Cy,orientation); Grasp Offset (dx,dy); Unlock Axis (R)
  - xxx.txt: box[0],box[1],box[2],box[3],dx,dy,R
  - xxx_annotated.png: handle annotated image

Original dataset (before augmentation) can be found in here (todo ...). You can run handle_data_augmentation.py for data augmentation.

Training

cd gum_package
python train.py

You can check checkpoints in checkpoints (todo ...).

5. Closed System with Haptic Feedback

Haptic feedback in 3 motion primitives. For unlock-lever and unlock-knob, the current threshold for the elbow joint tells the robot when to stop. For open the increase/decrease of current feedback on the elbow joint shows the push-/pull-type of the door.

The haptic feedback is implemented in the primitive.py and arm.py.

6. Main Result

Our method consistently outperforms other combinations, showing an average success rate improvement from 50% to 90% across all manipulation tasks. None of the doors nor the handles are seen in the training set, proving our model’s generalizability across different situations.

6. Other Results

Examples of how GUM fixes bad grasp pose during our field test.

With multi-modal feedback, our system can open the cabinet with an unknown unlocking direction via explore-and-adapt.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
images		images
open_door		open_door
.gitignore		.gitignore
DoorBot_ICRA2025.pdf		DoorBot_ICRA2025.pdf
DoorBot_slide.pdf		DoorBot_slide.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DoorBot: Closed-Loop Task Planning and Manipulation for Door Opening in the Wild with Haptic Feedback

1. Hardware Setup

2. Door Opening

Server

Local

Get the code:

Opening-door task:

3. Primitives Design

4. Grasping-and-Unlocking Model (GUM)

Dataset

Training

5. Closed System with Haptic Feedback

6. Main Result

6. Other Results

About

Releases

Packages

Languages

TX-Leo/DoorBot

Folders and files

Latest commit

History

Repository files navigation

DoorBot: Closed-Loop Task Planning and Manipulation for Door Opening in the Wild with Haptic Feedback

1. Hardware Setup

2. Door Opening

Server

Local

Get the code:

Opening-door task:

3. Primitives Design

4. Grasping-and-Unlocking Model (GUM)

Dataset

Training

5. Closed System with Haptic Feedback

6. Main Result

6. Other Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages