country_code

Rack ‘n’ Roll: NVIDIA Grace Hopper Systems Gather at GTC

Accelerated systems from 11 vendors packing NVIDIA Grace Hopper Superchips will be on display at the global AI conference.
by
Picture of MGX modular servers

The spirit of software legend Grace Hopper will live on at NVIDIA GTC.

Accelerated systems using powerful processors — named in honor of the pioneer of programming — will be on display at the global AI conference running March 18-21, ready to take computing to the next level.

System makers will show more than 500 servers in multiple configurations across 18 racks, all packing NVIDIA GH200 Grace Hopper Superchips. They’ll form the largest display at NVIDIA’s booth in the San Jose Convention Center, filling the MGX Pavilion.

MGX Speeds Time to Market

NVIDIA MGX is a blueprint for building accelerated servers with any combination of GPUs, CPUs and data processing units (DPUs) for a wide range of AI, high performance computing and NVIDIA Omniverse applications. It’s a modular reference architecture for use across multiple product generations and workloads.

GTC attendees can get an up-close look at MGX models tailored for enterprise, cloud and telco-edge uses, such as generative AI inference, recommenders and data analytics.

The pavilion will showcase accelerated systems packing single and dual GH200 Superchips in 1U and 2U chassis, linked via NVIDIA BlueField-3 DPUs and NVIDIA Quantum-2 400Gb/s InfiniBand networks over LinkX cables and transceivers.

The systems support industry standards for 19- and 21-inch rack enclosures, and many provide E1.S bays for nonvolatile storage.

Grace Hopper in the Spotlight

Here’s a sampler of MGX systems now available:

  • ASRock RACK’s MECAI-GH-200, measuring 420 x 445 x 87mm, accelerates AI and 5G services in constrained spaces at the edge of telco networks.
  • ASUS’s MGX server, the ESC NM2N-E1, slides into a rack that holds up to 32 GH200 processors and supports air- and water-cooled nodes.
  • Foxconn provides a suite of MGX systems, including a 4U model that accommodates up to eight NVIDIA H100 NVL PCIe Tensor Core GPUs.
  • GIGABYTE’s XH23-VG0-MGX can accommodate plenty of storage in its four 2.5-inch Gen5 NVMe hot-swappable bays and two M.2 slots.
  • Inventec’s systems can slot into 19- and 21-inch racks and use three different implementations of liquid cooling.
  • Lenovo supplies a range of 1U, 2U and 4U MGX servers, including models that support direct liquid cooling.
  • Pegatron’s air-cooled AS201-1N0 server packs a BlueField-3 DPU for software-defined, hardware-accelerated networking.
  • QCT can stack 16 of its QuantaGrid D74S-IU systems, each with two GH200 Superchips, into a single QCT QoolRack.
  • Supermicro’s ARS-111GL-NHR with nine hot-swappable fans is part of a portfolio of air- and liquid-cooled GH200 and NVIDIA Grace CPU systems.
  • Wiwynn’s SV7200H, a 1U dual GH200 system, supports a BlueField-3 DPU and a liquid-cooling subsystem that can be remotely managed.
  • Wistron’s MGX servers are 4U GPU systems for AI inference and mixed workloads, supporting up to eight accelerators in one system.

The new servers are in addition to three accelerated systems using MGX announced at COMPUTEX last May — Supermicro’s ARS-221GL-NR using the Grace CPU and QCT’s QuantaGrid S74G-2U and S74GM-2U powered by the GH200.

Grace Hopper Packs Two in One

System builders are adopting the hybrid processor because it packs a punch.

GH200 Superchips combine a high-performance, power-efficient Grace CPU with a muscular NVIDIA H100 GPU. They share hundreds of gigabytes of memory over a fast NVIDIA NVLink-C2C interconnect.

The result is a processor and memory complex well-suited to take on today’s most demanding jobs, such as running large language models. They have the memory and speed needed to link generative AI models to data sources that can improve their accuracy using retrieval-augmented generation, aka RAG.

Recommenders Run 4x Faster

In addition, the GH200 Superchip delivers greater efficiency and up to 4x more performance than using the H100 GPU with traditional CPUs for tasks like making recommendations for online shopping or media streaming.

In its debut on the MLPerf industry benchmarks last November, GH200 systems ran all data center inference tests, extending the already leading performance of H100 GPUs.

In all these ways, GH200 systems are taking to new heights a computing revolution their namesake helped start on the first mainframe computers more than seven decades ago.

Explore generative AI sessions and experiences at NVIDIA GTC, the global conference on AI and accelerated computing, running March 18-21 in San Jose, Calif., and online.

And get the 30,000-foot view from NVIDIA CEO and founder Jensen Huang in his GTC keynote

  • Categories:
  • AI

NVIDIA Powers Over 400 of the World’s 500 Fastest Supercomputers

by

News Highlights:

  • NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list.
  • 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up eight from the previous list.
  • The top eight systems on the Green500 run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies.
  • No. 1 on the Green500, KAIROS, uses a single NVIDIA Grace Hopper Superchip.
  • 376 of the TOP500 systems are interconnected using NVIDIA networking.

NVIDIA technologies power more than 400 of the world’s 500 fastest supercomputers — 81% of the TOP500 — according to the latest rankings released this week at the ISC High Performance conference in Hamburg, Germany.

That’s a gain of 17 systems from the previous list, with the momentum in new deployments: nearly nine of every 10 systems new to the ranking are built on NVIDIA technologies.

That percentage reflects a deliberate preference for machines built for AI, simulation and science together. And it’s compounding: NVIDIA systems across the TOP500 now deliver more than 2x the AI training and nearly 3x the AI inference throughput of every other platform combined.

GPU and networking adoption each hit new highs, with NVIDIA GPUs accelerating a record 238 systems and NVIDIA networking connecting a record 376 — the vast majority on NVIDIA Quantum InfiniBand, the backbone of large-scale AI and high-performance computing, and the rest on Ethernet. 

The trend behind the numbers is bigger than any one list: Accelerated computing is becoming the foundation for the systems taking on the world’s most demanding work, across AI and science.

Updated twice a year, the TOP500 ranks the world’s fastest supercomputers, while the Green500 list measures how much computing each delivers per watt.

A Full-Stack Footprint

NVIDIA’s reach now spans the full system — GPU, networking and, increasingly, the CPU — with NVIDIA Grace CPU adoption reaching 26 systems, up eight from the previous list, with nearly 2.5 million Grace CPUs shipped.

NVIDIA Grace-based machines sit atop both rankings: JUPITER at No. 5 and Alps at No. 10 on the TOP500, and KAIROS at No. 1 on the Green500.

Each pairs an NVIDIA GPU with the Grace CPU in a single NVIDIA Grace Hopper Superchip, letting the two share memory with minimal overhead — a design built for the memory-intensive demands of modern AI.

The NVIDIA Vera CPU, announced earlier this year, builds on the success of Grace, taking CPU performance and energy efficiency to new levels for the most demanding AI workloads in modern data centers — where agents move from answering basic questions to taking actions, running code, using tools and evaluating results.

Topping the Efficiency List

NVIDIA swept the Green500 ranking of the most energy-efficient supercomputers: The top eight all run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies. 

Leading the list is KAIROS, an NVIDIA Grace Hopper system at France’s University of Toulouse, at 73.3 gigaflops per watt — with Grace Hopper systems taking the top four spots, across France, Germany and the U.K.

From Exascale Science to the Next Wave

A record 35 NVIDIA AI HPC supercomputers are in development across Europe — equipping more than 3 million researchers with next-generation infrastructure for continental AI, accelerated science and industrial innovation.

Among these systems is JUPITER, Europe’s fastest supercomputer and its first to reach exascale, at the Jülich Supercomputing Centre in Germany.

JUPITER is mapping the human brain at cellular scale, simulating Earth’s climate and advancing the AI behind next-generation 6G networks.

The newest arrivals to the list run on the NVIDIA Blackwell architecture, with B200 and GB200 systems entering the rankings across Asia, Europe and the U.S. — and the first GB200 systems debuting in Japan.

The buildout is global, from a new AI factory in South Africa to national AI systems in Saudi Arabia, Singapore and Vietnam.

It’s the same story up and down the list: the world’s AI buildout is running on NVIDIA.

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

by

For the past two years, the U.S. National Science Foundation’s National Artificial Intelligence Research Resource (NAIRR) pilot program has driven innovative research across the U.S. for over 700 projects — spanning protein prediction and infectious disease outbreak management. 

NVIDIA contributed to the NAIRR pilot through a cloud-based resource that gives researchers dedicated access to a minimum of four NVIDIA DGX nodes for at least a month. NVIDIA also provided technical support to onboard and assist the researchers throughout their projects. 

With NVIDIA’s AI infrastructure support and DGX reference architecture providing dedicated resources, researchers have collapsed workflow timelines and uncovered groundbreaking technologies that will reshape and advance industries such as healthcare, agriculture and energy. 

The potential for scientific exploration and discovery across the nation through NAIRR is boundless. Learn more about a few NAIRR projects below. 

Physical Simulations With Polymathic AI’s Well Dataset

Simulation-to-real pipelines are becoming increasingly common across industries as a safer, more cost-efficient deployment method. 

Polymathic AI — a coalition of international scientists from Flatiron Institute, Cambridge University and Lawrence Berkeley National Lab — with the help of NVIDIA GPUs and NVIDIA NVLink interconnect technology, is strengthening physical, fluidlike simulations with its large-scale dataset called the “Well.” The dataset will be used to train the largest and most broadly applicable foundation model for fluidlike behavior to date. 

This foundation model, named Walrus, has been made publicly available along with its data, code and pertained weights. 

Polymathic AI’s approach builds on previous work in physics pretraining environments — addressing current limitations in scale and pretraining diversity. The research group also plans to explore scaling laws to help accelerate the development of more powerful foundation models for scientific applications.

University of Michigan’s Fusion Model for Energy Storage

Energy, a foundation of society, requires designing novel and efficient materials for energy storage and conversion.

Researchers at the University of Michigan, led by Professor Venkat Viswanathan in the Department of Aerospace engineering, are developing a model-fusion framework that brings together domain-specific molecular AI and general-purpose large language models. The goal is to help computational scientists more easily explore chemical space, ask chemistry-specific questions in natural language and identify promising materials for next-generation energy technologies. 

The family of molecular foundation models, MIST (the Molecular Insight SMILES Transformers), is designed for discovery and exploration across chemical space. 

MIST models were pretrained on large unlabeled molecular datasets and use a novel tokenizer, Smirk, to better capture nuclear, electronic, geometric, isotopic and stereochemical information from molecular representations. MIST models have been fine-tuned on more than 400 structure-property relationships and can match or exceed state-of-the-art performance across benchmarks spanning electrochemistry, quantum chemistry, physiology and other domains. 

MIST was developed on a 40-GPU NVIDIA DGX cluster the researchers gained as part of a NAIRR allocation and an additional 200,000 NVIDIA GPU hours on ALCF’s Polaris cluster. The team used NVIDIA’s NGC PyTorch container to support reproducible GPU-accelerated development across the different clusters.

Fusing MIST with general-purpose LLMs makes accurate quantum-chemical calculations more broadly accessible and accelerates the design of energy storage and conversion systems needed to enable widespread electrification of transportation, such as in the heavy-duty and aviation sectors.

Boston University’s BEACON AI Pipeline for Infectious Disease Detection 

Infectious diseases can spread rapidly in communities, causing surges in outbreaks. 

Boston University’s Hariri Institute for Computing and the Center on Emerging Infectious Diseases is working to train and evaluate a LLM using NVIDIA accelerated compute, through an AI pipeline to support an outbreak monitoring program called BEACON — Biothreats Emergence, Analysis and Communications Network.  

This LLM is being trained using a large corpus of documents on infectious diseases and epidemic-prone priority pathogens to support the work of field experts and outbreak analysts working on BEACON.

The model will be capable of analyzing online posts of emerging disease outbreaks on a global scale to extract features for downstream categorization and prioritization. BEACON will process signals from a variety of sources — including global disease-tracking platform HealthMap, news and social media feeds, subject-matter experts and individual communications via community boards or social media — to generate concise outbreak reports.  

These comprehensive outbreak analyses can inform clinical practice guidelines for emerging infectious diseases and identify gaps where further data is needed. 

Internationally deployed doctors, government organizations and academic researchers are already using the BEACON model to quickly identify and treat infectious diseases. 

“When you talk to infectious disease experts about what they used to do before we developed this pipeline, it used to take several hours for them to compose a report,” said Ioannis Paschalidis, director of Boston University’s Hariri Institute. “Now, producing a report gets done in roughly two minutes.” 

NAIRR and NVIDIA Across the Nation 

The latest scientific research doesn’t end there. Many other universities — including Harvard, Stanford, Colorado State University and more — are pioneering scientific breakthroughs with the help of NAIRR and NVIDIA. 

With scientists gaining broader access to AI and accelerated computing, innovation for a safer and healthier nation are more tangible than ever. 

Learn more about the NAIRR pilot program and explore how NVIDIA is driving academic research.

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Mission, Vision and Veritas supercomputers with Vera CPUs to advance materials simulation, scientific AI agents and molecular design.
by

Mission, Vision and Veritas — new Los Alamos National Laboratory (LANL) supercomputers to be built with HPE and NVIDIA — are tapping NVIDIA Vera CPUs to accelerate scientific discovery, unlocking agentic AI for science.

The supercomputers will use the HPE Cray Supercomputing GX5000 architecture with the NVIDIA Vera Rubin platform, combining NVIDIA Vera CPUs, NVIDIA Rubin GPUs and NVIDIA Quantum-X800 InfiniBand networking.

Under the planned configuration, Mission will include NVIDIA Vera Rubin GPU nodes and 2,300 standalone NVIDIA Vera CPUs using the HPE Cray Supercomputing GX240 blade. Veritas will feature approximately 1,150 standalone NVIDIA Vera CPUs to complement NVIDIA Vera Rubin nodes. 

Veritas will arrive alongside Mission and Vision and serve the Laboratory Directed Research and Development program, helping accelerate agentic AI for science. The system will test these technologies for use in larger systems being built out at LANL. 

Researchers are adding a new tool for science with AI agents that can form hypotheses, choose tools, launch simulations, analyze outputs and refine the next step. LANL’s public work on URSA, the Universal Research and Scientific Agent — running on Venado and soon Mission and Vision — points in this direction: a modular, feedback-driven AI framework designed to help scientists brainstorm hypotheses, plan experiments, run simulations and analyze results. 

LANL demonstrated that the Vera CPU delivered 7x higher performance on URSA workloads than the CPUs in the Crossroads x86 supercomputer.

Vera CPU for Agents and Simulation

In LANL’s early testing of NVIDIA Vera CPUs on Branson — an open source Monte Carlo heat transfer simulation tool — Vera outperforms the CPUs used in the Crossroads x86 supercomputer by over 3x. 

These results were made possible by Vera, including its custom Olympus core, LPDDR5 memory and fast on-chip fabric. 

A single Vera CPU outperforms a single socket x86-based CPU by more than 3x while providing more than 4x the memory per core and 6x the memory per node. Ultimately, this means faster  scientific results for LANL.

All of the lab’s supercomputers were codesigned by hardware architects, system software developers, domain scientists, computer scientists and applied mathematicians — helping ensure systems are shaped by real scientific workloads, not abstract benchmarks alone. 

Building on Generations of LANL Systems

Mission, expected to be operational in 2027, will be the fifth Advanced Technology System in the National Nuclear Security Administration’s Advanced Simulation and Computing program and will replace Crossroads for classified national security workloads. 

Vision, also expected to be operational in 2027, will serve as a resource for fundamental science, including materials and nuclear science, energy modeling, biomedical research and AI — letting more scientists test methods, train models and explore ideas before moving into higher-consequence work.

The work extends more than a decade of LANL and NVIDIA’s deep collaboration on CPUs, from Grace to Vera, using extreme codesign for LANL simulation workloads.

The three new supercomputers build on Venado, the HPE Cray EX supercomputer installed at Los Alamos in 2024 with NVIDIA GH200 Grace Hopper Superchips and NVIDIA Grace CPU Superchips. 

Learn more about the NVIDIA Vera CPU.