NVIDIA Shifts to In-House Server Manufacturing, Reducing OEMs to a Supporting Role

NVIDIA Unveils "Vera Rubin" Product Stack to Transform AI Server Manufacturing

NVIDIA is set to redefine the landscape of high-performance AI server infrastructure with its upcoming "Vera Rubin" product stack, scheduled for volume production as the VR200 line in late 2026. This new approach marks a significant shift from the traditional model, where NVIDIA supplied CPUs and GPUs to original equipment manufacturers (OEMs). Instead, NVIDIA will deliver fully assembled L10 compute trays, each pre-configured with "Vera" CPUs, "Rubin" accelerators, high-speed memory, 800G network interface cards (NICs), robust 110kW power delivery systems, and integrated liquid-cooling solutions. Every unit will undergo comprehensive validation before leaving the factory, ensuring reliability and performance consistency.

Impact on OEMs and System Vendors

This streamlined manufacturing process will prompt system vendors and OEMs to pivot away from designing core server electronics. Their focus will shift toward rack-level assembly, power configuration, installation of rack cooling sidecars, and final system certification. As a result, the traditional avenues for hardware differentiation among OEMs and hyperscalers will narrow, with more value and profit potential moving upstream to certified component manufacturers and large-scale integrators.

Technical Innovations and Industry Implications

The "Rubin" silicon is rumored to feature a thermal design power (TDP) of up to 2.3 kW for the highest-end models, pushing rack-level power consumption beyond 250 kW. This unprecedented power density makes custom cooling solutions less economically viable. NVIDIA’s strategy involves standardizing both cooling and power delivery across two distinct performance tiers, while also increasing network I/O density. These design choices reduce the practicality and cost-effectiveness of third-party replication, further centralizing production with select partners.

Samir Patel Samir is a hardware analyst who follows chipmakers, supply chains, and performance benchmarks closely. His articles provide deep dives into what’s powering the future of computing.