OpenAI's First Custom Chip: 'Jalapeño,' Built With Broadcom for LLM Inference
OpenAI and Broadcom unveiled Jalapeño on June 24, 2026 — OpenAI's first custom AI accelerator, designed in-house for LLM inference and manufactured by Broadcom. Co-developed from design to tape-out in nine months, it reportedly cuts inference costs ~50% versus typical AI GPUs, with first deployment targeted by end of 2026. Here's what it means for OpenAI's Nvidia reliance.

OpenAI just moved into silicon. On June 24, 2026, OpenAI and Broadcom unveiled “Jalapeño,” OpenAI’s first custom AI accelerator — an “Intelligence Processor” architected in-house specifically for LLM inference and built by Broadcom. The chip reportedly delivers around 50% cost savings versus typical AI GPUs, and it’s the first of a multi-generation compute platform the two companies are building together.
Fast-moving story. Performance, cost and timeline figures below reflect the June 24, 2026 announcement and company statements. Real-world results depend on deployment at scale — treat the numbers as point-in-time vendor claims until independently benchmarked.
At a glance
| Detail | |
|---|---|
| Name | Jalapeño — OpenAI’s first “Intelligence Processor” |
| Partner / manufacturer | Broadcom |
| Purpose | LLM inference (serving ChatGPT and other models) |
| Cost claim | ~50% cheaper than typical AI GPUs (per Broadcom CEO Hock Tan) |
| Design cycle | Concept to tape-out in ~9 months |
| First deployment | Targeted by end of 2026, expanding in following years |
| Strategic goal | Reduce reliance on external GPU suppliers (e.g., Nvidia) |
What happened
OpenAI and Broadcom jointly announced Jalapeño, described as an accelerator “architected around OpenAI’s vision for the future of LLM inference” — and the first AI accelerator in a multi-generation platform the companies plan to build together. The chips will be manufactured by Broadcom and used by OpenAI for inference: the compute-intensive job of serving models to users in ChatGPT and other applications.
According to Broadcom CEO Hock Tan, the accelerator is showing cost savings of roughly 50% compared with typical AI GPUs. The companies are aiming for initial deployment by the end of 2026, “expanding in the years ahead,” and tie it to bringing gigawatt-scale data centers online with Microsoft and other partners.
The nine-month sprint
One of the most striking claims is speed. Jalapeño went from initial design to manufacturing tape-out in just nine months — what the companies call one of the fastest ASIC development cycles ever for high-performance advanced semiconductors.
They credit deep software-hardware co-development: OpenAI’s engineering teams shaping the architecture around its own models, Broadcom’s silicon-implementation expertise, and — notably — the use of OpenAI’s own models to accelerate parts of the design and optimization process. In other words, AI helping design the chip that will run AI.
Why inference is the target
OpenAI didn’t aim its first chip at training — it aimed at inference, and that’s the tell. Training is bursty and research-driven; inference is the relentless, every- request cost of running a product used by hundreds of millions of people. Shaving ~50% off the cost of serving each token compounds enormously at OpenAI’s scale.
By designing the accelerator narrowly around how its own models actually run, OpenAI can strip out general-purpose overhead that a do-everything GPU has to carry — trading flexibility for efficiency on the one workload that dominates its bill.
Why it matters
1. Less dependence on a single supplier. Custom silicon is OpenAI’s clearest move yet to reduce reliance on external hardware providers like Nvidia for its core infrastructure — diversifying supply and gaining leverage on price and roadmap.
2. Pressure on GPU pricing power. A credible, ~50%-cheaper inference chip from one of the largest AI buyers puts the incumbent’s pricing power on notice. Even if OpenAI keeps buying GPUs, having its own option changes the negotiation.
3. A bet on owning the stack. Pairing custom chips with gigawatt-scale data centers signals OpenAI wants to control more of its compute destiny end-to-end — the same vertical-integration playbook the largest hyperscalers have run with their own in-house accelerators.
Bottom line
Jalapeño is OpenAI’s first real step into hardware: an in-house-designed, Broadcom-built inference chip, taped out in a remarkable nine months, claiming ~50% cost savings over typical AI GPUs, with first deployment targeted for end of 2026. The strategic message is louder than the spec sheet — OpenAI wants to own more of its compute and lean less on outside suppliers. The real test comes when these chips run production traffic at scale; until then, the cost and performance claims are the company’s to prove.
Sources: OpenAI and Broadcom announcements and press coverage from June 24, 2026, including CNBC, Bloomberg and The Decoder. Performance and cost figures are company- stated and point-in-time; await independent benchmarks before relying on them.
Related articles

AI-Rally Jitters: Tech Stocks Tumble as Micron Earnings Become the Big Test
Tech stocks slid hard around June 24, 2026, with the Nasdaq 100 on track to shed over $1 trillion as Nvidia, AMD and Tesla led the selling. Investors treated Micron's Q3 report — due June 24 — as a pulse check on whether the AI memory boom can keep justifying record valuations. Here's what moved and why.

Global Tech Selloff Deepens: Chip and AI Stocks Lead a Worldwide Rout
A broad tech rout swept global markets on June 23, 2026. South Korea's Kospi crashed ~10% on Samsung and SK Hynix, triggering a trading halt, before the selling hit Europe and the US — where memory and AI-chip names led the Nasdaq and S&P 500 lower. Here's what moved, why, and what to watch.

Micron Crushes It: Record $41.5B Quarter Reignites the AI Chip Trade
Micron reported blockbuster fiscal Q3 2026 results on June 24, 2026 — record revenue of $41.46 billion (up ~346% YoY) and $25.11 non-GAAP EPS, both crushing estimates — then guided to a stunning ~$50 billion next quarter. Shares jumped ~15%, SK Hynix and Samsung rallied, and the whole HBM/AI memory trade got its confidence back. Here's the breakdown.
Have a project or an idea?
We don't just write about software — we build it. Tell us what you're working on and we'll get back within 1–2 business days.