Yilin Wu 吴怡琳

yilinwu.jpg

PhD student @ RI, SCS, CMU

yilinwu [at] andrew [dot] cmu [dot] edu

Email is the best way to contact me!

I am a third-year Ph.D. student at Intent Lab in CMU Robotics Institute, advised by Prof. Andrea Bajcsy. I received my Master from Stanford CS, supervised by Prof. Dorsa Sadigh. In my undergrad, I was also fortunate to work with Prof. Lerrel Pinto and Prof. Pieter Abbeel.

For industry experiences, I have had great internships at Nvidia Seattle Robotics Lab and the Amazon Frontier AI & Robotics Lab.

My research interest lies in building better foundation models for robotics (fast, multi-modal world models) and leveraging those foundation models for inference-time adaptation (policy steering, etc.). I am also broadly interested in enhancing robots’ capabilities for more complex, long-horizon contact-rich manipulation with imitation learning, reinforcement learning, and multi-modal sensing.

Earlier at CMU, I was fortunate to work with Prof. David Held on generalizable methods for long-horizon contact-rich manipulation. During my Master at Stanford, I focused on assistive feeding and bimanual manipulation. I also worked with Prof. Yi Wu from Tsinghua University at Shanghai Qi Zhi Institute on reinforcement learning and self-imitation, and in my undergrad worked on reinforcement learning for deformable object manipulation.

More broadly, my research centers around overcoming the fundamental embodiment gap that limits the application of foundation models to robotics. By grounding high-level semantic reasoning in the continuous dynamics of the physical world, I want to build robots that can learn, reason, and act capably in human-centered environments, enhancing their generalization and robustness through runtime alignment and continual learning.

News

Jun 2026 SEAL received the Best Paper Award at the CVPR MMRAGI Workshop (Multi-modal Reasoning for Agentic Intelligence)!
Jun 2026 Gave a spotlight talk about my recent two works UPS (Uncertainty-aware Policy Steering) and ViTaL (Inference-time Policy Steering via Vision and Touch).
May 2026 Started my summer internship at the Amazon Frontier AI & Robotics Team!
Apr 2026 One paper Uncertainty-Aware Policy Steering (UPS) has been accepted at RSS 2026.
Jan 2026 Excited to share my work from my internship at Nvidia (Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification) is accepted to ICRA 2026!
Nov 2025 Excited to give a talk about my research on Language-Guided Runtime Steering with Robot Foundation Models at Duke DexLab!
Oct 2025 Excited to give a talk about my research on Language-Guided Runtime Steering with Robot Foundation Models at UT Austin Robin Lab!
Jul 2025 Excited to give a talk about my FOREWARN paper at Robotics Team in Meta FAIR !
Jun 2025 One paper From Foresight to Forethought: VLM-in-the-loop Policy Steering via Latent Alignment got accepted to RSS 2025.
May 2025 Excited to start my summer internship in Nvidia Seattle Robotics Lab!
Jun 2024 One paper Learning Generalizable Tool-use Skills through Trajectory Generation got accepted at IROS 2024.
May 2024 Open X-Embodiment wins the Best Paper Award at ICRA 2024.
May 2024 Two papers DROID and HACMan++ are accepted by RSS 2024.
Sep 2023 Our work on bimanual manipulation inspired by human coordination is accepted to CoRL 2023 as Oral presentation.
Aug 2023 Starting my Ph.D study at CMU Robitic Institute.

Publications

  1. arxiv
    wu_2026_vital.gif
    Inference-time Policy Steering via Vision and Touch
    Yilin Wu, Zilin Si, Zeynep Temel, Oliver Kroemer, and Andrea Bajcsy
    In arxiv, 2026
  2. arxiv
    wu_2026_weaver.gif
    𝚆𝙴𝙰𝚅𝙴𝚁, Better, Faster, Longer: An Effective World Model for Robotic Manipulation
    Arnav Kumar Jain*Yilin Wu*, Jesse Farebrother, Gokul Swamy, and Andrea Bajcsy
    In arxiv, 2026
  3. icml
    reward.png
    Position: Good Embodied Reward Models Need Bad Behavior Data
    Ran TianYilin Wu, and Andrea Bajcsy
    In International Conference on Machine Learning (ICML), 2026
    Spotlight top 5%
  4. rss
    ups.gif
    When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering
    Jessie Yuan*Yilin Wu*, and Andrea Bajcsy
    In Robotics: Science and Systems (RSS), 2026
  5. ICRA
    seal.gif
    Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification
    Yilin Wu , Anqi Li, Tucker Hermans, Fabio Ramos, Andrea Bajcsy*, and Claudia P’erez-D’Arpino*
    In 2026 IEEE International Conference on Robotics and Automation (ICRA), 2026
    Outstanding Paper Award at CVPR 2026 2nd Multimodal Reasoning For Agentic Intelligence Workshop
  6. RSS
    forewarn.gif
    From Foresight to Forethought: VLM-In-the-Loop Policy Steering via Latent Alignment
    Yilin WuRan TianGokul Swamy, and Andrea Bajcsy
    In Robotics: Science and Systems (RSS), 2025
    Outstanding Paper Award at ICLR 2025 World Model Workshop
  7. arxiv
    rapl_2024.gif
    Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment
    In submission to the International Journal of Robotics Research’s (IJRR) special issue on Foundation Models and Neural-Symbolic AI for Robotics, 2024
  8. IROS
    toolgen.gif
    Learning Generalizable Tool-use Skills through Trajectory Generation
    In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
  9. RSS
    hacman.gif
    HACMan++: Spatially-Grounded Motion Primitives for Manipulation
    In Robotics: Science and Systems (RSS), 2024
  10. RSS
    droid.gif
    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
    DROID Dataset Team
    In Robotics: Science and Systems (RSS), 2024
  11. ICRA
    openx.gif
    Open X-Embodiment: Robotic Learning Datasets and RT-X Models
    Open X-Embodiment Team
    In IEEE International Conference on Robotics and Automation (ICRA), 2024
    Best Paper Award
  12. CoRL
    buds.gif
    Stabilize to Act: Learning to Coordinate for Bimanual Manipulation
    In Proceedings of the 7th Conference on Robotic Learning (CoRL), 2023
    Oral Presentation [6.6%]
  13. ICRA
    bite_transfer.gif
    In-Mouth Robotic Bite Transfer with Visual and Haptic Sensing
    In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023
  14. CoRL
    carbs_after.gif
    Learning Bimanual Scooping Policies for Food Acquisition
    In Proceedings of the 6th Conference on Robot Learning (CoRL), 2022
  15. ICLR
    sir_fetch.gif
    Solving Compositional Reinforcement Learning Problems via Task Reduction
    Yunfei LiYilin WuHuazhe XuXiaolong Wang, and Yi Wu
    In International Conference on Learning Representations (ICLR), 2021
  16. RSS
    mvp.gif
    Learning to Manipulate Deformable Objects without Demonstrations
    In Robotics: Science and Systems, (RSS), 2020