OpenAI, Broadcom unveil first AI inference chip

Published June 24, 2026

OpenAI and Broadcom launched their first AI accelerator called Jalapeno with plans to deploy by the end of the year. OpenAI's custom silicon effort is part of a plan to deliver a vertically integrated stack for LLMs.

In a statement, OpenAI said early testing indicates Jalapeno will deliver better performance per watt relative to the current state-of-the-art chips.

Jalapeno kicks off what's a multi-generation custom silicon plan for AI. According to OpenAI, the chip was built from a blank slate and "informed by its roadmap of models, kernels, serving systems, and product needs." Broadcom and Celestica will integrate the Jalapeno with board, rack systems, networking and production systems.

OpenAI said its chip will work with all LLMs. The company said:

"Jalapeño is designed with flexibility to work with all LLMs guided by OpenAI’s insights into the inference needs of current and future AI models across the industry. Engineering samples of the Jalapeño chip are running ML workloads in the lab at production target frequency and power, including GPT‑5.3‑Codex‑Spark."

OpenAI didn't provide performance data, but promised a detailed technical report in the months ahead.

Key points:

The architecture of the chip reduces data movement and balances compute, memory and networking.
Broadcom's silicon includes the Tomahawk networking chips.
OpenAI's bet is that it can be more efficient if it owns the stack top to bottom.
"The goal is to combine the power and throughput of today’s leading AI accelerators with latency closer to the fastest specialized inference systems, making Jalapeño well suited for interactive LLM products at scale," according to OpenAI.
Jalapeno was co-developed in about nine months. OpenAI noted that it used its own models to accelerate the design and optimization process.

A few observations:

OpenAI is following a well-worn playbook of building infrastructure to support its business. The effort rhymes with what Google, Meta and Amazon have done. Should OpenAI compete with Apple in the future (the company has a heavy dose of Google and Apple envy) custom silicon could be a differentiator.
The big question is whether custom silicon can pay off fast enough for OpenAI’s business model.
It’s notable that OpenAI noted that its custom silicon would work with any LLM. Could OpenAI become an infrastructure provider?
The diversification away from Nvidia continues.

Constellation Research analyst Holger Mueller said:

"LLM vendors have to bring down costs for AI and the place to partner is with the vendor (Broadcom) who has helped the leader in AI cost - Google. The pedigree is there and now OpenAI has to get the chip built. Installing Jalapeno in data centers will not be a challenge given OpenAI's market and purchasing power. This is a shot in front of the bow for Nvidia."

Goldman Sachs Q2: The AI takeaways for the top, and bottom lines

Data to Decisions

Goldman Sachs' stellar second quarter results provided a multi-layer view of AI's impact on revenue growth and productivity. Executives laid out a nuanced view of the AI boom....

Larry Dignan

Insights News

July 14, 2026

AWS fleshes out Security Hub with AI workload protection, Microsoft Azure support

Digital Safety, Privacy & Cybersecurity

Amazon Web Services outlined multi-cloud security management for Microsoft Azure as well as a set of GuardDuty tools to protect individual workloads. ...

Larry Dignan

Insights News

July 14, 2026

Bank of America's AI usage: 400,000 prompts a day, more than 300 use cases approved

Future of Work

Bank of America CEO Brian Moynihan said AI is driving productivity gains with more than 300 use cases approved with quot;new capabilities coming on every week.quot;...

Larry Dignan

Insights News

July 14, 2026

Is a FINRA-ish group the answer for frontier AI standards?

Data to Decisions

Dennis Hassabis, CEO of Google DeepMind, argued in a post on X that frontier AI models need a standard for new releases and FINRA (Financial Industry Regulatory Authority) serves a...

Larry Dignan

Insights News

July 14, 2026

IBM's Q2 falls short as budgets shift to server, storage, memory

Board Strategy

IBM said its second quarter earnings and revenue will fall short of expectations due to weaker demand for its software and infrastructure. CEO Arvind Krishna said customers shifted...

Larry Dignan

Insights News

July 14, 2026

Startups to know: Baz brings governance, planning, specs to AI-generated code

Future of Work

Baz, an agentic coding and AI code review platform, is aiming to move quality upstream in the AI software development lifecycle. Here's what you need to know....

Larry Dignan

Published

June 24, 2026

Author

Larry Dignan

OpenAI, Broadcom unveil first AI inference chip

Goldman Sachs Q2: The AI takeaways for the top, and bottom lines

AWS fleshes out Security Hub with AI workload protection, Microsoft Azure support

Bank of America's AI usage: 400,000 prompts a day, more than 300 use cases approved

Is a FINRA-ish group the answer for frontier AI standards?

IBM's Q2 falls short as budgets shift to server, storage, memory

Startups to know: Baz brings governance, planning, specs to AI-generated code

Published

Author

Research

Analyst Services

Videos

Communities

Events

Insights Live

OpenAI, Broadcom unveil first AI inference chip

Results

Published

Author

Business Themes

Vendors

Audience Role

Hot Topics

Related Blog Posts