NVIDIA’s subsequent DGX supercomputer is all about generative AI

CEO Jensen Hiang made a string of bulletins throughout his Computex keynote, together with particulars concerning the firm’s subsequent DGX supercomputer. Given the place the business is clearly heading, it shouldn’t come as a shock that the DGX GH200 is basically about serving to corporations develop fashions.
The supercomputer makes use of a brand new NVLink Change System to allow 256 GH200 Grace Hopper superchips to behave as a single GPU (every of the chips has an Arm-based Grace CPU and an H100 Tensor Core GPU). This, in accordance with NVIDIA, permits the DGX GH200 to ship 1 exaflop of efficiency and to have 144 terabytes of shared reminiscence. The corporate says that is almost 500 instances as a lot reminiscence as you’d discover in a single DGX A100 system.
For comparability, the of the Top500 supercomputers lists as the one identified exascale system, having reached a efficiency of almost 1.2 exaflops on the Linmark benchmark. That is over twice the height efficiency of the second-placed system, Japan’s .
In impact, NVIDIA claims to have developed a supercomputer that may stand alongside essentially the most highly effective identified system on the planet (Meta is constructing one which it claims would be the quickest AI supercomputer on this planet as soon as it’s absolutely constructed out). NVIDIA says the structure of the DGX GH200 gives 10 instances extra bandwidth than the earlier era, “delivering the facility of an enormous AI supercomputer with the simplicity of programming a single GPU.”
Some huge names have an interest within the DGX GH200. Google Cloud, Meta and Microsoft ought to be among the many first corporations to achieve entry to the supercomputer to check the way it can deal with generative AI workloads. NVIDIA says DGX GH200 supercomputers ought to be out there by the tip of 2023.
The corporate can be constructing its personal supercomputer, Helios, that mixes 4 DGX GH200 programs. NVIDIA expects Helios to be on-line by the tip of the 12 months.
Huang mentioned different generative AI developments throughout his keynote, together with one on the gaming entrance. NVIDIA Avatar Cloud Engine (ACE) for Video games is a service builders will be capable to faucet into to be able to create customized AI fashions for speech, dialog and animation. NVIDIA says ACE for Video games can “give non-playable characters conversational expertise to allow them to reply to questions with lifelike personalities that evolve.”