country_code

Telco GPT: Survey Shows Scale of Industry’s Enthusiasm and Adoption of Generative AI

Generative AI is fueling investments in AI, but the telecom industry remains at the early phase of the AI investment cycle.
by

It’s been five years since the telecommunications industry first deployed 5G networks to drive new performance levels for customers and unlock new value for telcos.

But that industry milestone has been overshadowed by the emergence of generative AI and the swift pace at which telcos are embracing large language models as they seek to transform all parts of their business.

A recent survey of more than 400 telecommunications industry professionals from around the world showed that generative AI is the breakout technology of the year and that enthusiasm and adoption for both generative AI, and AI in general, is booming. In addition, the survey showed that, among respondents, AI is improving both revenues and cost savings.

The generative AI insight is the main highlight in the second edition of NVIDIA’s “State of AI in Telecommunications” survey, which included questions covering a range of AI topics, including infrastructure spending, top use cases, biggest challenges and deployment models.

Survey respondents included C-suite leaders, managers, developers and IT architects from mobile telecoms, fixed and cable companies. The survey was conducted over eight weeks between October and December.

Ramping Up on Generative AI

The survey results show how generative AI went from relative obscurity in 2022 to a key solution within a year. Forty-three percent of respondents reported they were investing in it, showing clear evidence that the telecom industry is enthusiastically embracing the generative AI wave to address a wide variety of business goals.

More broadly, there was a marked increase in interest in adopting AI and growing expectations of success from the technology, especially among industry executives. In the survey, 53% of respondents agreed or strongly agreed that adopting AI will be a source of competitive advantage, compared to 39% who reported the same in 2022. For management respondents, the figure was 56%.

The primary reason for this sustained engagement is because many industry stakeholders expect AI to contribute to their company’s success. Overall, 56% of respondents agreed or strongly agreed that “AI is important to my company’s future success,” with the figure rising to 61% among decision-making management respondents. The overall figure is a 14-point boost over the 42% result from the 2022 survey.

Customer Experience Remains Key Driver of AI Investment 

Telcos are adopting AI and generative AI to address a wide variety of business needs. Overall, 31% of respondents said they invested in at least six AI use cases in 2023, while 40% are planning to scale to six or more use cases in 2024.

But enhancing customer experiences remains the biggest AI opportunity for the telecom industry, with 48% of survey respondents selecting it as their main goal for using the technology. Likewise, some 35% of respondents identified customer experiences as their key AI success story.

For generative AI, 57% are using it to improve customer service and support, 57% to improve employee productivity, 48% for network operations and management, 40% for network planning and design, and 32% for marketing content generation.

Early Phase of AI Investment Cycle

The focus on customer experience is influencing investments. Investing in customer-experience optimization remains the most popular AI use case for 2023 (49% of respondents) and for generative AI investments (57% of respondents).

Telcos are also investing in other AI use cases: security (42%), network predictive maintenance (37%), network planning and operations (34%) and field operations (34%) are notable examples. However, using AI for fraud detection in transactions and payments had the biggest jump in popularity between 2022 and 2023, rising 14 points to 28% of respondents.

Overall, investments in AI are still in an early phase of the investment cycle, although growing strongly. In the survey, 43% of respondents reported an investment of over $1 million in AI in their previous year, 52% reported the same for the current year, and 66% reported their budget for AI infrastructure will increase in the next year.

For those who are already investing in AI, 67% reported that AI adoption has helped them increase revenues, with 19% of respondents noting that this revenue growth is more than 10% in specific business areas. Likewise, 63% reported that AI adoption has helped them reduce costs in specific business areas, with 14% noting that this cost reduction is more than 10%.

Innovation With Partners

While telcos are increasing their investments to improve their internal AI capabilities, partnerships remain critical for the adoption of AI solutions in the industry. This is applicable both for AI models and AI hardware infrastructure.

In the survey, 44% of respondents reported that co-development with partners is their company’s preferred approach to building AI solutions. Some 28% of respondents prefer to use open-source tools, while 25% take an AI-as-a-service approach. For generative AI, 29% of respondents built or customized models with a partner, an understandable conservative approach for the telecom industry with its stringent data protection rules.

For infrastructure, increasingly, many telcos are opting for cloud hosting, although the hybrid model still remains dominant. In the survey, 31% of respondents reported that they run most of their AI workloads in the cloud (44% for hybrid), compared to 21% of respondents in the previous survey (56% for hybrid). This is helping to fuel the growing need for more localized cloud infrastructure.

Download the “State of AI in Telecommunications: 2024 Trends” report for in-depth results and insights.

Explore how AI is transforming telecommunications at NVIDIA GTC, featuring industry leaders including Amdocs, Indosat, KT, Samsung Research, ServiceNow, Singtel, SoftBank, Telconet and Verizon.

Learn more about NVIDIA solutions for telecommunications across customer experience, network operations, sovereign AI factories and more.

  • Categories:
  • AI

NVIDIA Powers Over 400 of the World’s 500 Fastest Supercomputers

by

News Highlights:

  • NVIDIA technology runs 81% of the TOP500 and 90% of the systems new to the list.
  • 26 systems on the TOP500 adopted the NVIDIA Grace CPU, up eight from the previous list.
  • The top eight systems on the Green500 run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies.
  • No. 1 on the Green500, KAIROS, uses a single NVIDIA Grace Hopper Superchip.
  • 376 of the TOP500 systems are interconnected using NVIDIA networking.

NVIDIA technologies power more than 400 of the world’s 500 fastest supercomputers — 81% of the TOP500 — according to the latest rankings released this week at the ISC High Performance conference in Hamburg, Germany.

That’s a gain of 17 systems from the previous list, with the momentum in new deployments: nearly nine of every 10 systems new to the ranking are built on NVIDIA technologies.

That percentage reflects a deliberate preference for machines built for AI, simulation and science together. And it’s compounding: NVIDIA systems across the TOP500 now deliver more than 2x the AI training and nearly 3x the AI inference throughput of every other platform combined.

GPU and networking adoption each hit new highs, with NVIDIA GPUs accelerating a record 238 systems and NVIDIA networking connecting a record 376 — the vast majority on NVIDIA Quantum InfiniBand, the backbone of large-scale AI and high-performance computing, and the rest on Ethernet. 

The trend behind the numbers is bigger than any one list: Accelerated computing is becoming the foundation for the systems taking on the world’s most demanding work, across AI and science.

Updated twice a year, the TOP500 ranks the world’s fastest supercomputers, while the Green500 list measures how much computing each delivers per watt.

A Full-Stack Footprint

NVIDIA’s reach now spans the full system — GPU, networking and, increasingly, the CPU — with NVIDIA Grace CPU adoption reaching 26 systems, up eight from the previous list, with nearly 2.5 million Grace CPUs shipped.

NVIDIA Grace-based machines sit atop both rankings: JUPITER at No. 5 and Alps at No. 10 on the TOP500, and KAIROS at No. 1 on the Green500.

Each pairs an NVIDIA GPU with the Grace CPU in a single NVIDIA Grace Hopper Superchip, letting the two share memory with minimal overhead — a design built for the memory-intensive demands of modern AI.

The NVIDIA Vera CPU, announced earlier this year, builds on the success of Grace, taking CPU performance and energy efficiency to new levels for the most demanding AI workloads in modern data centers — where agents move from answering basic questions to taking actions, running code, using tools and evaluating results.

Topping the Efficiency List

NVIDIA swept the Green500 ranking of the most energy-efficient supercomputers: The top eight all run on NVIDIA GPUs and nine of the top 10 use NVIDIA technologies. 

Leading the list is KAIROS, an NVIDIA Grace Hopper system at France’s University of Toulouse, at 73.3 gigaflops per watt — with Grace Hopper systems taking the top four spots, across France, Germany and the U.K.

From Exascale Science to the Next Wave

A record 35 NVIDIA AI HPC supercomputers are in development across Europe — equipping more than 3 million researchers with next-generation infrastructure for continental AI, accelerated science and industrial innovation.

Among these systems is JUPITER, Europe’s fastest supercomputer and its first to reach exascale, at the Jülich Supercomputing Centre in Germany.

JUPITER is mapping the human brain at cellular scale, simulating Earth’s climate and advancing the AI behind next-generation 6G networks.

The newest arrivals to the list run on the NVIDIA Blackwell architecture, with B200 and GB200 systems entering the rankings across Asia, Europe and the U.S. — and the first GB200 systems debuting in Japan.

The buildout is global, from a new AI factory in South Africa to national AI systems in Saudi Arabia, Singapore and Vietnam.

It’s the same story up and down the list: the world’s AI buildout is running on NVIDIA.

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

by

For the past two years, the U.S. National Science Foundation’s National Artificial Intelligence Research Resource (NAIRR) pilot program has driven innovative research across the U.S. for over 700 projects — spanning protein prediction and infectious disease outbreak management. 

NVIDIA contributed to the NAIRR pilot through a cloud-based resource that gives researchers dedicated access to a minimum of four NVIDIA DGX nodes for at least a month. NVIDIA also provided technical support to onboard and assist the researchers throughout their projects. 

With NVIDIA’s AI infrastructure support and DGX reference architecture providing dedicated resources, researchers have collapsed workflow timelines and uncovered groundbreaking technologies that will reshape and advance industries such as healthcare, agriculture and energy. 

The potential for scientific exploration and discovery across the nation through NAIRR is boundless. Learn more about a few NAIRR projects below. 

Physical Simulations With Polymathic AI’s Well Dataset

Simulation-to-real pipelines are becoming increasingly common across industries as a safer, more cost-efficient deployment method. 

Polymathic AI — a coalition of international scientists from Flatiron Institute, Cambridge University and Lawrence Berkeley National Lab — with the help of NVIDIA GPUs and NVIDIA NVLink interconnect technology, is strengthening physical, fluidlike simulations with its large-scale dataset called the “Well.” The dataset will be used to train the largest and most broadly applicable foundation model for fluidlike behavior to date. 

This foundation model, named Walrus, has been made publicly available along with its data, code and pertained weights. 

Polymathic AI’s approach builds on previous work in physics pretraining environments — addressing current limitations in scale and pretraining diversity. The research group also plans to explore scaling laws to help accelerate the development of more powerful foundation models for scientific applications.

University of Michigan’s Fusion Model for Energy Storage

Energy, a foundation of society, requires designing novel and efficient materials for energy storage and conversion.

Researchers at the University of Michigan, led by Professor Venkat Viswanathan in the Department of Aerospace engineering, are developing a model-fusion framework that brings together domain-specific molecular AI and general-purpose large language models. The goal is to help computational scientists more easily explore chemical space, ask chemistry-specific questions in natural language and identify promising materials for next-generation energy technologies. 

The family of molecular foundation models, MIST (the Molecular Insight SMILES Transformers), is designed for discovery and exploration across chemical space. 

MIST models were pretrained on large unlabeled molecular datasets and use a novel tokenizer, Smirk, to better capture nuclear, electronic, geometric, isotopic and stereochemical information from molecular representations. MIST models have been fine-tuned on more than 400 structure-property relationships and can match or exceed state-of-the-art performance across benchmarks spanning electrochemistry, quantum chemistry, physiology and other domains. 

MIST was developed on a 40-GPU NVIDIA DGX cluster the researchers gained as part of a NAIRR allocation and an additional 200,000 NVIDIA GPU hours on ALCF’s Polaris cluster. The team used NVIDIA’s NGC PyTorch container to support reproducible GPU-accelerated development across the different clusters.

Fusing MIST with general-purpose LLMs makes accurate quantum-chemical calculations more broadly accessible and accelerates the design of energy storage and conversion systems needed to enable widespread electrification of transportation, such as in the heavy-duty and aviation sectors.

Boston University’s BEACON AI Pipeline for Infectious Disease Detection 

Infectious diseases can spread rapidly in communities, causing surges in outbreaks. 

Boston University’s Hariri Institute for Computing and the Center on Emerging Infectious Diseases is working to train and evaluate a LLM using NVIDIA accelerated compute, through an AI pipeline to support an outbreak monitoring program called BEACON — Biothreats Emergence, Analysis and Communications Network.  

This LLM is being trained using a large corpus of documents on infectious diseases and epidemic-prone priority pathogens to support the work of field experts and outbreak analysts working on BEACON.

The model will be capable of analyzing online posts of emerging disease outbreaks on a global scale to extract features for downstream categorization and prioritization. BEACON will process signals from a variety of sources — including global disease-tracking platform HealthMap, news and social media feeds, subject-matter experts and individual communications via community boards or social media — to generate concise outbreak reports.  

These comprehensive outbreak analyses can inform clinical practice guidelines for emerging infectious diseases and identify gaps where further data is needed. 

Internationally deployed doctors, government organizations and academic researchers are already using the BEACON model to quickly identify and treat infectious diseases. 

“When you talk to infectious disease experts about what they used to do before we developed this pipeline, it used to take several hours for them to compose a report,” said Ioannis Paschalidis, director of Boston University’s Hariri Institute. “Now, producing a report gets done in roughly two minutes.” 

NAIRR and NVIDIA Across the Nation 

The latest scientific research doesn’t end there. Many other universities — including Harvard, Stanford, Colorado State University and more — are pioneering scientific breakthroughs with the help of NAIRR and NVIDIA. 

With scientists gaining broader access to AI and accelerated computing, innovation for a safer and healthier nation are more tangible than ever. 

Learn more about the NAIRR pilot program and explore how NVIDIA is driving academic research.

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

Mission, Vision and Veritas supercomputers with Vera CPUs to advance materials simulation, scientific AI agents and molecular design.
by

Mission, Vision and Veritas — new Los Alamos National Laboratory (LANL) supercomputers to be built with HPE and NVIDIA — are tapping NVIDIA Vera CPUs to accelerate scientific discovery, unlocking agentic AI for science.

The supercomputers will use the HPE Cray Supercomputing GX5000 architecture with the NVIDIA Vera Rubin platform, combining NVIDIA Vera CPUs, NVIDIA Rubin GPUs and NVIDIA Quantum-X800 InfiniBand networking.

Under the planned configuration, Mission will include NVIDIA Vera Rubin GPU nodes and 2,300 standalone NVIDIA Vera CPUs using the HPE Cray Supercomputing GX240 blade. Veritas will feature approximately 1,150 standalone NVIDIA Vera CPUs to complement NVIDIA Vera Rubin nodes. 

Veritas will arrive alongside Mission and Vision and serve the Laboratory Directed Research and Development program, helping accelerate agentic AI for science. The system will test these technologies for use in larger systems being built out at LANL. 

Researchers are adding a new tool for science with AI agents that can form hypotheses, choose tools, launch simulations, analyze outputs and refine the next step. LANL’s public work on URSA, the Universal Research and Scientific Agent — running on Venado and soon Mission and Vision — points in this direction: a modular, feedback-driven AI framework designed to help scientists brainstorm hypotheses, plan experiments, run simulations and analyze results. 

LANL demonstrated that the Vera CPU delivered 7x higher performance on URSA workloads than the CPUs in the Crossroads x86 supercomputer.

Vera CPU for Agents and Simulation

In LANL’s early testing of NVIDIA Vera CPUs on Branson — an open source Monte Carlo heat transfer simulation tool — Vera outperforms the CPUs used in the Crossroads x86 supercomputer by over 3x. 

These results were made possible by Vera, including its custom Olympus core, LPDDR5 memory and fast on-chip fabric. 

A single Vera CPU outperforms a single socket x86-based CPU by more than 3x while providing more than 4x the memory per core and 6x the memory per node. Ultimately, this means faster  scientific results for LANL.

All of the lab’s supercomputers were codesigned by hardware architects, system software developers, domain scientists, computer scientists and applied mathematicians — helping ensure systems are shaped by real scientific workloads, not abstract benchmarks alone. 

Building on Generations of LANL Systems

Mission, expected to be operational in 2027, will be the fifth Advanced Technology System in the National Nuclear Security Administration’s Advanced Simulation and Computing program and will replace Crossroads for classified national security workloads. 

Vision, also expected to be operational in 2027, will serve as a resource for fundamental science, including materials and nuclear science, energy modeling, biomedical research and AI — letting more scientists test methods, train models and explore ideas before moving into higher-consequence work.

The work extends more than a decade of LANL and NVIDIA’s deep collaboration on CPUs, from Grace to Vera, using extreme codesign for LANL simulation workloads.

The three new supercomputers build on Venado, the HPE Cray EX supercomputer installed at Los Alamos in 2024 with NVIDIA GH200 Grace Hopper Superchips and NVIDIA Grace CPU Superchips. 

Learn more about the NVIDIA Vera CPU.