Akamai Inference Cloud deploys Nvidia AI Grid

Published March 16, 2026

Nvidia GTC 2026 is turning out to be a bit of a coming-out party for Akamai's Inference Cloud. Akamai's Inference Cloud is the first global implementation of Nvidia AI Grid, which routes AI workloads across edge networks for lower latency and cost with improved performance.

Akamai has been advancing its Inference Cloud in recent weeks. Using its expertise in distributed compute, Akamai first expanded into offering cloud workloads and now has focused on expanding inference from data centers to the edge.

At Nvidia GTC 2026, Akamai said it has integrated Nvidia's infrastructure into its own to create a distributed grid for AI inference. The latest evolution of Akamai Inference Cloud, which received a shoutout in Nvidia CEO Jensen Huang's keynote, is the first to operationalize AI Grid. Nvidia GTC 2026 features a broad vision for AI inference and how it applies to agentic workloads.

Nvidia GTC 2026

"Our AI Grid intelligent orchestration gives AI factories a way to scale inference outward leveraging the same distributed architecture that revolutionized content delivery to route AI workloads across 4,400 locations, at the right cost, at the right time," said Adam Karon, Chief Operating Officer of Akamai's cloud unit.

The tight partnership with Nvidia and its focus on edge and inference puts Akamai in good position given that its network is globally distributed. Akamai is gaining workloads with caching and routing orchestration, egress allowances and 4,400 edge locations.

Akamai's bet is that Inference Cloud will gain AI workloads due to lower latency and costs and be seen as an enabler for AI agents.

Scaling Inference Cloud

The company's Inference Cloud is built in partnership with Nvidia and has a distributed architecture that uses Nvidia's Blackwell AI infrastructure. Akamai announced Inference Cloud in October and announced the following in the months leading up to Nvidia GTC.

Inference Cloud is latest in a cloud plan that formed in 2023 and expanded with the help of Nvidia

Here’s a look at how Akamai has scaled Inference Cloud in recent months.

Inference Cloud launch. Inference cloud features Nvidia RTX PRO Servers, featuring Nvidia RTX PRO 6000 Blackwell Server Edition GPUs, Nvidia BlueField-3 DPUs, and Nvidia AI Enterprise software with Akamai's distributed cloud computing infrastructure and global edge network.

Akamai said Inference Cloud was designed to extend AI factories to enable various agentic use cases and physical AI.

Use cases surge. In November, Akamai said early uses cases for Inference Cloud showing demand were 8k video workflows, live video intelligence, recommendation engines, assistive agents and AI powered fitting room experiences.

Partnerships added. Akamai said it was partnering with Visa to bring identity, user recognition and security controls to agentic commerce. The company also said it launched a partner program for independent software vendors.

Scale and GPU acquisitions. Akamai said in March that it has acquired thousands of Nvidia Blackwell GPUs to scale its Inference Cloud. The company also outlined the technical details of a $200 million services agreement inked with US tech company. Akamai disclosed the deal on its fourth quarter earnings call.

"We believe the AI market is entering a critical transition point, the first inning of a long game to come, where inference or the execution of queries against a trained model is the new frontier. This requires purpose-built infrastructure to enable distributed low-latency, globally scalable AI at the edge with response times measured in a few tens of milliseconds," said Akamai CEO Dr. Tom Leighton.

Larry Dignan

Editor in Chief of Constellation Insights
Constellation Research

Larry Dignan is Editor in Chief of Constellation Insights at Constellation Research, where he leads editorial coverage focused on enterprise technology, digital transformation, and emerging trends shaping the future of business. He oversees research-driven news, analysis, interviews, and event coverage designed to help technology buyers and vendors navigate complex markets with clarity and context. ...

Insight News

May 17, 2026

AI inference costs are going to be a big concern: What's the fix?

Data to Decisions

AI inference costs are turning up on earnings calls as enterprises try to get a handle on token costs as they ramp AI agents. The bigger question is whether the context layer is th...

Larry Dignan

Insight News

May 15, 2026

Figma makes its AI case as it gains wallet share

Future of Work

Figma's first quarter results and outlook indicate that the company's core customers are spending more with the design software company. The results counter the narrative that Figm...

Larry Dignan

Insight News

May 15, 2026

Boomi World 2026: Lessons learned from four customers

Data to Decisions

Boomi customers are increasingly using the company's platform as more than a narrow integration platform and focusing on AI agent use cases. Here's a look at the customer lessons. ...

Larry Dignan

Insight News

May 14, 2026

Boomi World 2026: Trends to know

Innovation & Product-led Growth

Boomi's product keynote surfaced a bevy of high-level takeaways that enterprise buyers need to think about as they navigate rapidly evolving agentic AI trends. Here's a look at th...

Larry Dignan

Insight News

May 13, 2026

Cisco Q3 strong, cashes in on AI infrastructure boom

Tech Optimization

Cisco reported strong third quarter results and raised its fourth quarter outlook as it cashes in on the AI infrastructure boom....

Larry Dignan

Insight News

May 13, 2026

The AI world according to Boomi CEO Steve Lucas

Data to Decisions

Boomi CEO Steve Lucas riffed on the company’s future bets, where he sees enterprise AI going, why he’s a headache as a vibe coder and why enterprises will always follow the economi...

Larry Dignan

Published

March 16, 2026

Author

Larry Dignan

Akamai Inference Cloud deploys Nvidia AI Grid

Scaling Inference Cloud

AI inference costs are going to be a big concern: What's the fix?

Figma makes its AI case as it gains wallet share

Boomi World 2026: Lessons learned from four customers

Boomi World 2026: Trends to know

Cisco Q3 strong, cashes in on AI infrastructure boom

The AI world according to Boomi CEO Steve Lucas

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

Akamai Inference Cloud deploys Nvidia AI Grid

Scaling Inference Cloud

Results

Published

Author

Business Themes

Vendors

Audience Role

Hot Topics

Related Blog Posts