Enabling physical AI applications such as autonomous vehicles and robotics is a challenging problem due to multiple factors including data collection, model architecture design, and performing real-time inference. In this half-day tutorial we will focus on lessons and challenges encountered in developing state-of-the-art software and hardware solutions. These include tools like Isaac Sim, Isaac Lab, manipulation models such as GR00T and ACT, and hardware such as NVIDIA Jetson Thor for performing real-time inference when deploying models on real robots.
Through this tutorial, the speakers will demonstrate how attendees can create an end-to-end robotics pipeline that involves data capturing and annotation both in simulation and the real world, fine-tuning models, and eventually deploying these models on robotic systems for real-time inference. Attendees will receive deployment guides for each component of this pipeline along with slides, training datasets, and code repositories for take-home exercises.
The robotics and Physical AI space has been a strong and growing topic at CVPR, especially with computer vision advancements in VLM and VLA models that have become key research areas in recent years. The community has developed several Vision-Language-Action models such as GR00T, π0, OpenVLA, SmolVLA, and ACT.
However, building a complete robotics pipeline, from data collection to model training to deployment, remains a challenging multi-disciplinary endeavor. Data collection often requires expensive hardware and software solutions, which has prohibited many researchers from pursuing this path. Foundation models require careful architecture design and post-training strategies. And deploying models on edge devices demands hardware-aware optimizations to achieve real-time performance.
This tutorial bridges these gaps by providing a hands-on, end-to-end walkthrough of the full Physical AI stack. By the end, attendees will understand the high-level frameworks, tools, and open-source community activities around robotics, embedded devices, and model training, enabling researchers, industry partners, and communities worldwide to improve collaborations in this complex and growing field.
This schedule is tentative and may be refined as the program is finalized.
Johnny Núñez
Mitesh Patel
Qi Wang
Johnny Núñez
Johnny Núñez, Mitesh Patel, Qi Wang, and all organizers
Mitesh Patel
Qi Wang
Covers GR00T and physical AI foundation models: what they enable, how they connect perception, language, and action, and how research gets translated into usable robotics products and workflows.
Johnny Núñez
Covers deploying physical AI models on Jetson Thor: model optimization, runtime constraints, real-time inference, hardware-aware deployment, and practical lessons for robots outside the lab.
Developer Advocate
NVIDIA
Johnny is a developer advocate at NVIDIA focusing on Physical AI and Robotics. He brings experience in computer vision, edge computing, and robotics from his experience in Computer Vision and Robotics, especially on Human-Robot-Object Interactions at the University of Barcelona. He is a key member of the Jetson Research Lab driving AI and robotics on edge devices.
Sr. Developer Advocate Manager
NVIDIA
Mitesh is a Senior Developer Advocate Manager at NVIDIA. His team creates workflows for GPU-accelerated data science and Generative AI applications. He previously was a Senior Research Scientist at FXPAL and Yahoo! Labs. He holds a PhD in Robotics from the University of Technology Sydney.
Product Manager
NVIDIA
Qi is a Product Manager at NVIDIA focused on embodied AI, robotics, and physical AI foundation models. Qi's work centers on translating frontier AI research into real-world products by driving product strategy, cross-functional execution, and ecosystem partnerships across research and engineering teams. With a background spanning software engineering, autonomous systems, and product leadership, Qi brings a blend of technical depth and product thinking. Qi is particularly passionate about humanoid robotics, generalist AI agents, and building the platforms and workflows that make advanced AI usable in the real world.
Developer Advocate Manager
NVIDIA
Raymond is the developer advocate manager at NVIDIA focusing on robotics and embedded systems. Previously, he was the global lead of the Intel AI evangelist team and co-founded YCombinator-backed augmented reality company Meta, raising over $80M. He holds a PhD and has spoken at TED Talks, SIGGRAPH, CVPR, NeurIPS, and more.
Sr. Technical Product Marketing Manager
NVIDIA
Chitoku is Senior Technical Product Marketing Manager for the NVIDIA Jetson Edge AI platform. He works closely with the developer community to evangelize pre-trained AI models and SDKs on Jetson, including tutorials on JetBot and JetRacers. He previously worked at Sony Corporation in Tokyo.
Product Lead for Robotics
NVIDIA
Spencer is a product line manager at NVIDIA leading robotics software products. His work centers on open-source simulation frameworks for robot learning, synthetic data generation, and advancing robot autonomy from industrial mobile manipulators to generalist humanoid robots.
* Equal contribution
NVIDIA's reference application for robotic simulation
Robot learning framework built on Isaac Sim
Open foundation model for generalist humanoid robots
HuggingFace's open-source robotics toolkit
Ultimate platform for Physical AI at the edge
Open-source library for optimizing LLM inference