JTree Labs

We are an open robotics research organization working on full-stack robotic systems research and learning.

See our latest release

Multitask Diffusion Transformer Policy

Open-Source Release

This method combines the highly scalable Diffusion Transformer (DiT) architecture with the expressive Diffusion Policy representation for robot manipulation. A similar method was demonstrated on the Boston Dynamics Atlas Humanoid robot, where we saw no open-source release and significant appetite from the research community to understand this method.

This inspired us to research it, estimate the architecture from what literature is available, and build an implementation from scratch. This model achieved a high level of performance on a variety of manipulation tasks with only 10–20 hours of training on an H200.

We partnered with the HuggingFace team to release the model as an open-source policy integrated into the LeRobot project. You can find more details about the model, training, and how to use it in the links below.

Multitask Diffusion Transformer Policy demonstration