Log inSign up
Jiafei Duan
1,812 posts
Image
user avatar
Jiafei Duan
@DJiafei
Assistant Professor at @NUScomputing| Robotics & AI PhD @uwcse| Host of @RoboPapers| Ex-@allen_ai, @NVIDIA my opinion is my alone.
Seattle, WA
jiafei1224.github.io
Joined January 2021
1,095
Following
6,253
Followers

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

TermsΒ·PrivacyΒ·CookiesΒ·AccessibilityΒ·Ads InfoΒ·Β© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up
  • Pinned
    user avatar
    Jiafei Duan
    @DJiafei
    May 5
    Most capable generalist robotics models today are closed or at best, open weights. But robotics won’t reach its ChatGPT moment without real openness. That GPT moment was built on years of open tools and datasets such as Python, PyTorch, ImageNet and more, that let researchers
    Image
    00:00
    Image
    01:41
    user avatar
    Ai2
    @allen_ai
    May 5
    Robotics models often struggle outside controlled environments. Ours is built to work in real ones. Today we're launching MolmoAct 2, which can assist with a host of chores & lab tasks, plus the MolmoAct 2-Bimanual YAM datasetβ€”the largest open robotics dataset of its kind. 🧡
    106K
  • user avatar
    Jiafei Duan
    @DJiafei
    Feb 9, 2025
    I am always activately looking for talented, passionate and hardworking students to work/collaborate/mentor with. Anyone interested, feel free to sign up here: forms.gle/oJPLR2pLTt8kLC…
    69K
  • user avatar
    Jiafei Duan
    @DJiafei
    Aug 12, 2025
    Reasoning is central to purposeful action. Today we introduce MolmoAct β€” a fully open Action Reasoning Model (ARM) for robotics. Grounded in large-scale pre-training with action reasoning data, every predicted action is interpretable and user-steerable via visual trace. We are
    Image
    00:00
    100K
  • user avatar
    Jiafei Duan
    @DJiafei
    Jan 30, 2025
    Can we build a generalist robotic policy that doesn’t justΒ memorizeΒ training data and regurgitate it during test time, but insteadΒ remembersΒ past actions as memory and conditions its decisions on them?πŸ€–πŸ’‘ IntroducingΒ SAM2Actβ€”aΒ multi-view robotic transformer-based policyΒ that
    Image
    00:00
    111K
  • user avatar
    Jiafei Duan
    @DJiafei
    Jan 20, 2025
    Every time I watch this video, I can't help but wonder: why don't we have robot butlers in our homes yet? The hardware seemed capable 14 years ago with teleoperation. Is it just the "robot brain" we're missing, or is there more to the puzzle?
    Image
    00:00
    47K
  • user avatar
    Jiafei Duan
    @DJiafei
    Jun 28, 2024
    πŸš€ Excited to share our latest work: MANIPULATE-ANYTHING! 🦾 This scalable method pushes the boundaries of real-world robotic manipulation through zero-shot task execution and automated BC data generation. Here's a quick overview:πŸ‘‡ robot-ma.github.io
    Image
    00:00
    55K
  • user avatar
    Jiafei Duan
    @DJiafei
    Sep 6, 2025
    We have now open-source the checkpoints from all training stages along with the full training and fine-tuning code. Check it out here:
    user avatar
    Jiafei Duan
    @DJiafei
    Aug 12, 2025
    Reasoning is central to purposeful action. Today we introduce MolmoAct β€” a fully open Action Reasoning Model (ARM) for robotics. Grounded in large-scale pre-training with action reasoning data, every predicted action is interpretable and user-steerable via visual trace. We are
    Image
    00:00
    Image
    GitHub - allenai/molmoact: Official Repository for MolmoAct
    From github.com
    22K
  • user avatar
    Jiafei Duan
    @DJiafei
    Feb 14, 2024
    For large-scale robotic deploymentπŸ€– in the real-world 🌏, robots must adapt to changes in environment and objects. Ever questioned the generalizability of your robot's manipulation policy? Put it to the test with The Colosseum πŸ›οΈ. Check out our project: robot-colosseum.github.io
    Image
    00:00
    37K
  • user avatar
    Jiafei Duan
    @DJiafei
    Jun 18, 2024
    Humans use pointing to communicate plans intuitively. Compared to language, pointing gives more precise guidance to robot behaviors. Can we teach a robot how to point like humans? Introducing RoboPoint πŸ€–πŸ‘‰, an open-source VLM instruction-tuned to point. Check out our new work:
    Image
    00:00
    64K
  • user avatar
    Jiafei Duan
    @DJiafei
    Dec 31, 2024
    πŸš€πŸ€– Top 10 Robot Learning Papers of 2024 is out!πŸ”₯ With 2 rounds of nominations & voting, plus 330+ individual votes, these standout papers shine across diverse categories: 1️⃣ Ο€0: Vision-Language-Action Flow Model for General Robot Control 2️⃣ Closed-Loop Open-Vocabulary Mobile
    30K
  • user avatar
    Jiafei Duan
    @DJiafei
    Jan 11, 2025
    πŸ“’We are currently hiring for multiple roles at the @Ai2Prior @allen_ai πŸš€ to build next-generation multimodal large language models πŸ€–, foundation models for robotics 🦾, and embodied AI 🧠. If open science research in these areas excites you πŸ”¬βœ¨, please apply here: Research
    23K
  • user avatar
    Jiafei Duan
    @DJiafei
    Feb 10, 2025
    🚨 Why do robots fail under out-of-distribution perturbations? Can we diagnose these failures in advanceβ€”andΒ 'prescribe' the right data to fix them? 🚨 Our new paper,Β RoboMDΒ introduces a systematic framework for diagnosing and improving robot manipulation policies. πŸ€–πŸ’‘
    Image
    00:00
    13K
  • user avatar
    Jiafei Duan
    @DJiafei
    Mar 14, 2025
    Exciting news! This summer, I’ll be joining @allen_ai as a Research Scientist Intern, working on scaling robotics foundation models. Looking forward to that!
    Image
    GIF
    13K
  • user avatar
    Jiafei Duan
    @DJiafei
    Sep 4, 2025
    Robot paper of the day: RoboBallet: Planning for multirobot reaching with graph neural networks and reinforcement learning UCL + Google DeepMind + Intrinsic built an AI planner that choreographs teams of arms to work in tight spaces without collisionsβ€”planning in seconds, not
    Image
    00:00
    15K
This post is unavailable.