
Nvidia's GPC tokenizes human motion like ChatGPT tokenizes text, letting humanoid robots improvise fall recovery without explicit programming. The research targets Tesla, Figure, and Apptronik.
Nvidia presented a new approach to robotics control at SIGGRAPH 2026. The company's research team built Generative Pretrained Controllers, or GPC, that tokenize human motion the same way large language models tokenize text. A robot trained on this system can improvise fall recovery without being explicitly programmed with a recovery routine.
The method borrows directly from the transformer architecture behind ChatGPT. Human movements are discretized into a vocabulary of motion tokens. A transformer then learns to predict the next token, generating sequential physical behaviors. The GPC was trained on more than 600 hours of motion data.
Traditional controllers require engineers to define reward signals for every task. GPC sidesteps that by creating a general-purpose motion foundation model. Controllers produced by the system are reusable and fine-tunable for new tasks, the researchers said.
Nvidia is pursuing several parallel motion tokenization projects. MotionBricks uses structured multi-head tokenizers trained on roughly 350,000 motion clips and achieves processing speeds up to 15,000 frames per second for real-time animation. Kimodo focuses on text-to-motion generation. AMPLIFY explores how robots can generalize from video data to physical actions.
All feed into Nvidia's Isaac GR00T platform, which serves as the integration layer connecting motion models with simulation tools like Isaac Lab. Researchers train policies in simulation before deploying them in the real world.
The 600 hours of training data is substantial by robotics standards, though in the world of GPT-class language models it is small. Motion tokenization remains in relatively early development, the team noted.
NVDA shares closed at $197.58 on Monday, down 1.25%. AlphaScala's proprietary model rates the stock at 65 out of 100, a moderate score.
Companies building humanoid robots – Tesla's Optimus program, Figure, Apptronik – all need to solve the motion problem. The 15,000-fps processing speed of MotionBricks suggests the latency problem is solvable. Robustness in unpredictable environments remains an open challenge, the researchers said.
Prepared with AlphaScala research tooling and grounded in primary market data: live prices, fundamentals, SEC filings, hedge-fund holdings, and insider activity. Each story is checked against AlphaScala publishing rules before release. Educational coverage, not personalized advice.