Nvidia unveils new GPU designed for long-context inference
Samira Vishwas | September 10, 2025 1:24 AM CST

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPXdesigned for context windows larger than 1 million tokens.
Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.
Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.
The Rubin CPX is slated to be available at the end of 2026.
READ NEXT
-
Mysterious 3i/Atlas from Space, Funny Structure and Co₂ with coma
-
‘Best Bifor’ of open sweets disappears, demand for sweets increased in festivals, but negligence on security
-
The right age and effective methods for children
-
Married men who seek out affairs on cheating sites all have this one thing in common
-
Is it really possible to be ‘Just Friend’ with X? Or it becomes the biggest difficulty of life