Jason Ma

Hi there! I'm a final-year PhD student at UPenn GRASP Laboratory, where I am fortunate to be advised by Dinesh Jayaraman and Osbert Bastani. My research interests span robot learning and reinforcement learning, with an emphasis on foundation models for robotics. During my PhD, I have also spent time at Google DeepMind, NVIDIA AI, and Meta AI.

News:

• Upcoming talk at Foundation Models for Interactive Robot workshop at RSS 2025!
• Selected as RSS Pioneers 2025!
• Goal-Contrastive Rewards and ZeroMimic accepted to ICRA 2025!
• Generative Value Learning (Spotlight) and Articulate-Anything accepted to ICLR 2025!

Selected recent talks:

I investigate core challenges of robot learning -- such as reward/environment design, progress estimation, and self-improvement -- that cannot be addressed by scaling robot data alone. To address these open problems, I pioneered new algorithms for training and leveraging foundation models from internet data to teach robots new tasks. Specifically, my research has

• introduced a new paradigm of scaling large language models (LLMs) test-time compute to automate sim-to-real robot learning (ICLR 2024; NVIDIA Top 10 Research Project of Year 2023, RSS 2024, CORL 2024 Oral)
• developed scalable and principled reinforcement learning algorithms that leverage human videos to learn visual rewards for self-improving robotic manipulation (ICLR 2023 Spotlight, ICRA 2024 Best Paper Finalist in Robot Vision; CORL 2023 LEAP Workshop Best Paper, ICRA 2025)
• demonstrated how to best combine multi-modal learning and RL to train and utilize vision-language models (VLMs) for universal value functions (ICML 2023, ICLR 2025 Spotlight)

Many of these methods have been incorporated into production-level simulation and deep learning software, such as NVIDIA Isaac Sim and Pytorch's official reinforcement learning library TorchRL.

Selected honors:

Selected media coverage:

Google Scholar Github Twitter

yechengma at gmail dot com

Some robots I have trained in my research:

Selected publications as first or last author; full list on Google Scholar.

(* indicates equal contribution, † indicates equal advising)

Vision-Language Models are In-Context Value Learners

Jason Ma*, Joey Hejna, Ayzaan Wahid, Chuyuan Fu, Dhruv Shah, Jacky Liang, Zhuo Xu, Sean Kirmani, Peng Xu, Danny Driess, Ted Xiao, Jonathan Tompson, Osbert Bastani, Dinesh Jayaraman, Wenhao Yu, Tingnan Zhang, Dorsa Sadigh, Fei Xia
International Conference on Learning Representations (ICLR), 2025
Webpage • Arxiv •

Articulate-Anything: Automatic Modeling of Articulated Objects via Vision-Language Models

Long Le, Jason Xie, William Liang, Hung-Ju Wang, Yue Yang, Jason Ma, Kyle Vedder, Arjun Krishna, Dinesh Jayaraman, Eric Eaton
International Conference on Learning Representations (ICLR), 2025
Webpage • Arxiv • Code

Eurekaverse: Environment Curriculum Generation via Large Language Models

William Liang, Sam Wang, Hungju Wang, Osbert Bastani, Dinesh Jayaraman†, Jason Ma†
Conference on Robot Learning (CoRL) (Oral) , 2024
Webpage • Arxiv • Code

On-Robot Reinforcement Learning with Goal-Contrastive Rewards

Ondrej Biza, Thomas Weng, Lingfeng Sun, Karl Schmeckpeper, Tarik Kelestemur, Jason Ma†, Robert Platt†, Jan-Willem van de Meent†, Lawson L. S. Wong†
International Conference on Robotics and Automation (ICRA), 2025
Arxiv •

DrEureka: Language Model Guided Sim-To-Real Transfer

Jason Ma*, William Liang*, Hungju Wang, Sam Wang, Yuke Zhu, Linxi "Jim" Fan, Osbert Bastani, Dinesh Jayaraman
Robotics: Science and Systems (RSS), 2024
Webpage • Arxiv • Code

Eureka: Human-Level Reward Design via Coding Large Language Models

Jason Ma, William Liang, Guanzhi Wang, De-An Huang, Osbert Bastani, Dinesh Jayaraman, Yuke Zhu, Linxi "Jim" Fan†, Anima Anandkumar†
International Conference on Learning Representations (ICLR), 2024
NVIDIA Top 10 Research Projects of 2023
Webpage • Arxiv • Code

Universal Visual Decomposer: Long-Horizon Manipulation Made Easy

Charles Zhang*, Yunshuang Li*, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Jason Ma†, Lucas Weihs†
International Conference on Robotics and Automation (ICRA) (Best Paper Finalist) , 2024
Best Paper Award, CORL 2023 LEAP Workshop
Webpage • Arxiv • Code

LIV: Language-Image Representations and Rewards for Robotic Control

Jason Ma, Vikash Kumar, Amy Zhang, Osbert Bastani, Dinesh Jayaraman
International Conference on Machine Learning (ICML), 2023
Webpage • Arxiv • Code

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar†, Amy Zhang†
International Conference on Learning Representations (ICLR) (Spotlight) , 2023
Best Paper Finalist, NeurIPS 2022 Deep RL Workshop
Webpage • Arxiv • Code

How Far I'll Go: Offline Goal-Conditioned RL via F-Advantage Regression

Jason Ma, Jason Yan, Dinesh Jayaraman, Osbert Bastani
Neural Information Processing Systems (NeurIPS) (Nominated for Outstanding Paper) , 2022
Webpage • Arxiv • Code

TOM: Learning Policy-Aware Models for MBRL via Transition Occupancy Matching

Jason Ma*, Kausik Sivakumar*, Jason Yan, Osbert Bastani, Dinesh Jayaraman
Learning for Decision and Control (L4DC), 2023
Webpage • Arxiv • Code

SMODICE: Versatile Offline Imitation from Observations and Examples

Jason Ma, Andrew Shen, Dinesh Jayaraman, Osbert Bastani
International Conference on Machine Learning (ICML), 2022
Webpage • Arxiv • Code

CAP: Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Jason Ma*, Andrew Shen*, Osbert Bastani, Dinesh Jayaraman
Association for the Advancement of Artificial Intelligence (AAAI), 2022
Arxiv • Code

Likelihood-Based Diverse Sampling for Trajectory Forecasting

Jason Ma, Jeevana Priya Inala, Dinesh Jayaraman, Osbert Bastani
International Conference on Computer Vision (ICCV), 2021
Arxiv • Code

Conservative Offline Distributional Reinforcement Learning

Jason Ma, Dinesh Jayaraman, Osbert Bastani
Neural Information Processing Systems (NeurIPS), 2021
Arxiv • Code

2025

Robotics Institute, CMU, April 2025

EECS Special Seminar, MIT, April 2025

Frontiers in Computer Science, Caltech, Feb 2025

Frontiers in Electrical Engineering, Caltech, Feb 2025

2024

MIT Embodied Intelligence Seminar

Brown Robotics Seminar

USC

The AI Institute

Stanford ILIAD Lab

Amazon Robotics

Stanford Vision and Learning Lab

University of Michigan

2023

MIT IAI Lab

UIUC Robot Learning Seminar

Northwestern Ability Lab

Johns Hopkins University Neuro AI

HKUST Info. Hub Seminar

MILA Robot Learning Seminar

Tsinghua University Yang Gao Lab

UT Austin MIDI Group

Intel AI Seminar

2022

University of Edinburgh RL Seminar

MILA RL Seminar

UPenn GRASP SFI Seminar

Guest Lecture at UPenn CIS 519: Applied Machine Learning

2024

Co-Organizer, RSS Workshop on Task Specification for General-Purpose Intelligent Robots

2023

Co-Organizer, NeurIPS Workshop on Goal-Conditioned Reinforcement Learning

2023

Co-Organizer, GRASP Student, Faculty, and Industry (SFI) Seminar

2021+

Reviewer, NeurIPS, ICML, ICLR, AAAI, ICRA, IROS, RA-L, CORL

I am looking to mentor highly motivated students to work on research projects all year long. I especially encourage students from underrepresented groups to get involved! If you are interested, please send me an email with your CV and your research interests.

Current

William Liang

Johnny Wang

Sam Wang

Past

Kausik Sivakumar

Charles Zhang

Yunshuang Li

Vaidehi Som

Andrew Shen

Jason Yan