country_code

What Is Federated Learning?

Federated learning is a way to develop and validate AI models from diverse data sources while mitigating the risk of compromising data security or privacy, as the data never leaves individual sites.
by Nicola Rieke
AI healthcare

Editor’s note: On April 16, 2024, we updated our original post on federated learning, which was first published October 13, 2019. 

The key to becoming a medical specialist, in any discipline, is experience.

Knowing how to interpret symptoms, which move to make next in critical situations, and which treatment to provide — it all comes down to the training you’ve had and the opportunities you’ve had to apply it.

For AI algorithms, experience comes in the form of large, varied, high-quality datasets. But such datasets have traditionally proved hard to come by, especially in the area of healthcare.

Federated learning is a way to develop and validate accurate, generalizable AI models from diverse data sources while mitigating the risk of compromising data security or privacy. It enables AI models to be built with a consortium of data providers without the data ever leaving individual sites.

Medical institutions have had to rely on their own data sources, which can be biased by, for example, patient demographics, the instruments used or clinical specializations. Or they’ve needed to pool data from other institutions to gather all of the information they need, which requires managing regulatory issues.

Federated learning makes it possible for AI algorithms to gain experience from a vast range of data located at different sites.

The approach enables several organizations to collaborate on the development of models, but without needing to directly share sensitive clinical data with each other.

Over the course of several training iterations the shared models get exposed to a significantly wider range of data than what any single organization possesses in-house.

Federated learning is gaining traction beyond healthcare, moving into financial services, cybersecurity, transportation, high performance computing, energy, drug discovery and other fields.

Frameworks such as NVIDIA FLARE (NVFlare) have enabled enterprises to collaborate by contributing data through federated learning for model improvements.

NVFlare, an open-source federated learning framework that’s widely adopted across various applications, offers a diverse range of examples of machine learning and deep learning algorithms. It includes robust security features, advanced privacy protection techniques and a flexible system architecture — building trust among users.

How Federated Learning Works 

The main concept of federated learning is to train models locally without sharing data, only the model parameters.

The aggregator starts with an initial global model and broadcasts the model parameters to all clients. The client node receives the global model parameters and starts training the received model on local data. Then, the newly trained local model is sent back to the aggregator node. Only model parameters, no private data, are shared with the aggregator.

The aggregator node will perform aggregation, such as weighted average, to produce a new global model. That new global model will be broadcast again by repeating the first step until convergence, or until it’s reached the max number of rounds.

AI algorithms deployed in medical scenarios ultimately need to reach clinical-grade accuracy. Largely speaking, this means that they meet, or exceed, the gold standard for the application to which they’re applied.

To be considered an expert in a particular medical field, you generally need to have clocked 15 years on the job. Such an expert has probably read around 15,000 cases in a year, which adds up to around 225,000 over their career.

When you consider rare diseases, which affect around one in 2,000 people, even an expert with three decades’ experience will have only seen roughly 100 cases of a particular condition.

To train models that meet the same grade as medical experts, the AI algorithms need to be fed a large number of cases. And these examples need to sufficiently represent the clinical environment in which they’ll be used.

But currently the largest open dataset contains 100,000 cases.

And it’s not only the amount of data that counts. It also needs to be very diverse and incorporate samples from patients of different genders, ages, demographics and environmental exposures.

Individual healthcare institutes may have archives containing hundreds of thousands of records and images, but these data sources are typically kept siloed. This is largely because health data is private and cannot be used without the necessary patient consent and ethical approval.

Federated learning decentralizes deep learning by removing the need to pool data into a single location. Instead, the model is trained in multiple iterations at different sites.

For example, say three hospitals decide to team up and build a model to help automatically analyze brain tumor images.

If they chose to work with a client-server federated approach, a centralized server would maintain the global deep neural network and each participating hospital would be given a copy to train on their own dataset.

Once the model had been trained locally for a couple of iterations, the participants would send their updated version of the model back to the centralized server and keep their dataset within their own secure infrastructure.

The central server would then aggregate the contributions from all of the participants. The updated parameters would then be shared with the participating institutes, so that they could continue local training.

A centralized-server approach to federated learning.

If one of the hospitals decided it wanted to leave the training team, this would not halt the training of the model, as it’s not reliant on any specific data. Similarly, a new hospital could choose to join the initiative at any time.

This is just one of many approaches to federated learning. The common thread through all approaches is that every participant gains global knowledge from local data — everybody wins.

Why Federated Learning?

Federated learning still requires careful implementation to ensure that patient data is kept secure. But it has the potential to tackle some of the challenges faced by approaches that require the pooling of sensitive clinical data.

For federated learning, clinical data doesn’t need to be taken outside an institution’s own security measures. Every participant keeps control of its own clinical data.

As this makes it harder to extract sensitive patient information, federated learning opens up the possibility for teams to build larger, more diverse datasets for training their AI algorithms.

Implementing a federated learning approach also encourages different hospitals, healthcare institutions and research centers to collaborate on building a model that could benefit them all.

How Federated Learning Can Transform Industries

Federated learning could revolutionize how AI models are trained, with the benefits also filtering out into the wider healthcare ecosystem.

Larger hospital networks would be able to work better together and benefit from access to secure, cross-institutional data. While smaller community and rural hospitals would enjoy access to expert-level AI algorithms.

It could bring AI to the point of care, enabling large volumes of diverse data from across different organizations to be included in model development, while complying with local governance of the clinical data.

Clinicians would have access to more robust AI algorithms, based on data that represents a wider demographic of patients for a particular clinical area or from rare cases that they would not have come across locally. They’d also be able to contribute back to the continued training of these algorithms whenever they disagreed with the outputs.

Healthcare startups could bring cutting-edge innovations to market faster, thanks to a secure approach to learning from more diverse algorithms.

Meanwhile, research institutions would be able to direct their work toward actual clinical needs, based on a wide variety of real-world data, rather than the limited supply of open datasets.

Large-scale federated learning projects are now starting, hoping to improve drug discovery and bring AI benefits to the point of care.

MELLODDY, a drug-discovery consortium based in the U.K., aims to demonstrate how federated learning techniques could give pharmaceutical partners the best of both worlds: the ability to leverage the world’s largest collaborative drug compound dataset for AI training without sacrificing data privacy.

King’s College London is hoping that its work with federated learning, as part of its London Medical Imaging and Artificial Intelligence Centre for Value-Based Healthcare project, could lead to breakthroughs in classifying stroke and neurological impairments, determining the underlying causes of cancers, and recommending the best treatment for patients.

In the context of financial services, federated learning can be applied to train a model using data from several banks to estimate individual transaction risk scores while keeping personal information locally at the banks.

Fraud detection is an important federated learning use case for banking and insurance. Institutions can harness data from user accounts and fraud cases to create better fraud-detection models without sacrificing user data privacy.

This can be challenging without federated learning, considering data privacy protection laws such as the EU’s GRPR, China’s PIPL and the recent EU AI Act, which prohibits cross-border data sharing. With federated learning, financial institutions can comply with these laws and regulations while using rich, private datasets for better, safer outcomes.

NVFlare can be used with XGBoost and Kaggle’s Credit Card Fraud Detection dataset for securing credit card transactions and with graph neural networks (GNNs) for financial transaction classification.

Federated learning is also applicable in use cases such as federated data analytics on edge medical devices, cross-board data training with autonomous vehicle models and drug discovery. Driven by data privacy regulations, the need to build better models with more private data, as well as the generative AI boom, the adoption of federal learning is accelerating.

Learn more about NVFlare. Explore more about federated learning on related NVIDIA technical blogs. And discover the science behind the approach, in this paper.

 

AI Is a 5-Layer Cake

by Jensen Huang

Now Live: The World’s Most Powerful AI Factory for Pharmaceutical Discovery and Development

by Rory Kelleher

NVIDIA Brings AI-Powered Cybersecurity to World’s Critical Infrastructure

Akamai, Forescout, Palo Alto Networks, Siemens and Xage Security integrate NVIDIA accelerated computing and AI to advance OT cybersecurity.
by Itay Ozery

As technologies and systems become more digitalized and connected across the world, operational technology (OT) environments and industrial control systems (ICS) — from energy and manufacturing to transportation and utilities — are increasingly depending on enterprise networks and the cloud. This expands OT and ICS capabilities — but also their exposure to cyber threats.

Unlike traditional IT environments that manage data and applications, OT systems control real-world processes where cyber incidents can have immediate consequences for safety, availability and operational continuity.

Many of these systems were originally designed for reliability and longevity, not for today’s threat techniques. This can widen the gap between modern attacks and existing defenses. Even as OT and ICS environments modernize with improved automation, connectivity and analytics, most were not built to withstand adaptive, software-driven cyberattacks that evolve in real time.

NVIDIA is collaborating with leading cybersecurity providers Akamai, Forescout, Palo Alto Networks and Xage Security, as well as industrial automation innovator Siemens, to bring accelerated computing and AI to OT cybersecurity, advancing real-time threat detection and response across critical infrastructure.

These efforts represent a fundamental shift in OT and ICS cybersecurity, where security is embedded into and distributed across infrastructure, enforced at the edge and coordinated through centralized, AI-driven intelligence, bringing modern cybersecurity to the systems that keep the physical world running.

Forescout and NVIDIA Bring Zero Trust to OT and ICS Environments

Zero trust is a security model that removes implicit trust from networks. Every user, device and workload must be continuously verified and authorized, regardless of where it originates.

While zero trust has been widely adopted to secure enterprise IT environments, applying its principles to OT environments has traditionally been difficult. Legacy devices, proprietary protocols and safety-critical operations limit the use of intrusive controls or AI-driven enforcement, even as increased connectivity to IT and cloud environments expands the attack surface.

Forescout is working with NVIDIA to make zero trust practical for OT. Forescout provides continuous, agentless discovery and classification of OT, internet of things and IT assets, delivering real-time risk assessment and policy-based enforcement. With deep visibility into network activity, Forescout applies network segmentation to contain lateral movement and enforce zero trust controls precisely where they matter most, without impacting operations.

At the industrial edge, NVIDIA BlueField DPUs run security services on dedicated hardware, keeping protection separate from operational systems so critical processes remain unaffected.

Siemens and Palo Alto Networks Embed Security Into Industrial Automation

Industrial automation environments demand consistent performance, low latency and high availability — requirements that traditional IT security tools often struggle to meet.

At the S4x26 security conference, Siemens will demonstrate its AI-ready Industrial Automation DataCenter, a unified, holistic solution that consolidates decades of cross-industry automation expertise into one robust IT/OT platform. The future-proof solution contains all the core elements of an edge data center such as computing based on virtualization, data archiving and reporting, resilient disaster recovery solutions, and a robust cybersecurity architecture in accordance with IEC 62443. Through the integration of NVIDIA BlueField, it is uniquely possible to deliver a truly AI-ready, zero-trust solution tailored for the demands on industrial automation.

Prisma AIRS AI Runtime Security delivers deep visibility into industrial traffic and continuous monitoring for abnormal behavior. By running these security services on NVIDIA BlueField, inspection and enforcement happen directly at the infrastructure level, closer to the workloads. This AI-powered approach strengthens security coverage and drives greater operational uptime where it matters most.

Akamai Extends Segmentation to OT and ICS With NVIDIA

Akamai Technologies has extended the Akamai Guardicore Platform to now run on NVIDIA BlueField, enabling agentless segmentation — the ability to isolate applications, devices or workloads into tightly controlled security zones — and the ability to enforce zero-trust policies directly at the edge. This removes the need for agents that may not be compatible with legacy OT systems or safety-certified devices.

Segmentation is enforced at full network speed directly within the infrastructure, without introducing latency or disrupting time-sensitive workloads in centralized data centers or remote edge locations. This helps contain threats quickly, limit their spread and keep mission-critical operations running smoothly.

Xage Security Protects the Energy Infrastructure That Powers AI With NVIDIA

As AI scales into a pillar of critical infrastructure, securing the energy systems that power AI factories is as essential as securing the compute itself.

Modern energy supply chains are complex, distributed and deeply interconnected with AI operations, and they operate largely within the operational technology domain. In this environment, cyber-physical systems, legacy assets and real-time controls demand security approaches purpose-built for critical infrastructure protection.

Xage Security is working with NVIDIA to help address this need by bringing zero-trust security to both energy infrastructure and the AI systems it supports. At S4x26, Xage will demonstrate a new integration running on NVIDIA BlueField, showing how zero trust enforcement can be embedded directly into energy and AI infrastructure environments.

Xage already protects about 60% of U.S. midstream pipeline infrastructure and works with utilities and energy operators worldwide. By combining Xage’s distributed, identity-based security platform with NVIDIA BlueField, operators can protect energy assets, manage third-party access and secure AI-driven operations at scale without compromising performance, reliability or resilience.

A New Class of OT Cybersecurity

Across these environments, a consistent OT cybersecurity architecture is taking shape. Security services run at the edge on NVIDIA BlueField DPUs, close to the operational systems they protect. By executing inspection and enforcement on dedicated, hardware-isolated infrastructure, BlueField enables continuous protection without disrupting time-sensitive operations.

OT data generated at the edge is sent to centralized AI factories, where it’s analyzed across many sites to identify patterns, anomalies and emerging threats. In addition, security actions are enforced locally at the edge, while insights are shared centrally — creating a coordinated defense that improves visibility, accelerates response and scales protection consistently across OT and IT environments.

This architecture helps detect and contain threats faster while strengthening resilience across distributed environments, maintaining consistent performance and protecting uptime.

The result is a new standard for securing critical infrastructure — where AI-driven protection and operational excellence move forward together.

NVIDIA-powered OT cybersecurity solutions are delivered through a global ecosystem of trusted partners. Read this OT cybersecurity use case and solution overview for more.

Join NVIDIA at S4x26, running Feb. 24–26 in Miami, to see how accelerated computing and AI are transforming cybersecurity for OT and critical infrastructure.