OpenAI Builds Its Own AI Chip in Latest Challenge to Nvidia | eWeek

OpenAI Builds Its Own AI Chip in Latest Challenge to Nvidia

Introducing the Jalapeno AI chip from OpenAI.

Image: OpenAI

Jun 25, 2026
4 minute read
eWeek content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

OpenAI wants to build more than AI models. Now it wants to build the chips that run them.

On Wednesday, OpenAI and Broadcom unveiled “Jalapeño,” the first custom artificial intelligence chip developed as part of a broader effort to create specialized infrastructure for the next generation of AI systems. The new processor is designed specifically for inference, the stage where AI models generate responses for users after training is complete. 

OpenAI says Jalapeño was built from the ground up to handle the demands of large language models, including the systems powering ChatGPT, Codex, API services, and future AI agents.

According to OpenAI, engineering samples of the chip are already running machine learning workloads in testing environments, including the company's GPT-5.3-Codex-Spark model. The company said early testing suggests the processor delivers significantly better performance per watt than current state-of-the-art alternatives.

A push to build the full AI stack

Jalapeño represents more than a new piece of hardware for OpenAI.

The company has increasingly emphasized its ambition to control more layers of the AI ecosystem, from models and products to infrastructure. OpenAI said the chip is part of a long-term strategy to build "the full stack" behind its AI services, allowing tighter integration between software and hardware.

In a statement, OpenAI President and Co-Founder Greg Brockman said, "Jalapeño is part of our long-term full-stack infrastructure strategy to make compute more abundant, resulting in AI which is faster, more reliable, more affordable for people and businesses, and can be used to solve more important problems."

The company argues that designing its own hardware allows it to optimize every layer of the system, from chip architecture and networking to deployment software and user-facing products, around the specific requirements of frontier AI models.

Built in just nine months

One of the most notable aspects of the project is the speed at which it was completed.

OpenAI and Broadcom said Jalapeño progressed from initial design to manufacturing tape-out in approximately nine months, a timeline the companies believe may represent the fastest ASIC development cycle achieved for a high-performance advanced semiconductor.

OpenAI said the same AI systems used by customers today helped engineers improve parts of the chip development process, offering a glimpse into how AI could increasingly contribute to the design of future computing hardware.

Reading between the lines

Strip away the branding, and Jalapeño says something specific about where OpenAI thinks the AI business is actually won or lost: not in training bigger models, but in the unglamorous cost of running them at scale, over and over, for hundreds of millions of users. 

Training a model happens once. Inference happens every single time someone uses it. As ChatGPT and Codex usage keep climbing, that per-query cost adds up fast, and it's the part of the business OpenAI has the least control over today.

It's also a hedge against a very specific kind of risk: dependency. OpenAI is preparing for an IPO that could value the company in the trillion-dollar range, and investors backing that kind of valuation will want to see a path to real margins, not just usage growth.

A company that has to buy all its compute from one supplier, at that supplier's prices, has a much harder time controlling its own economics. Building proprietary silicon doesn't eliminate that exposure, but it gives OpenAI a second lever to pull.

Advertisement

What it means for the industry

For OpenAI's customers, businesses building on the API, developers using Codex, and everyday ChatGPT users, the promise is cheaper, faster, more available AI. Lower inference costs are exactly the kind of thing that doesn't show up as a flashy new feature but quietly makes products more responsive and more affordable to run at scale.

For Broadcom, this cements its position as the company hyperscalers call when they want custom silicon without building an in-house chip team from nothing. Broadcom already does this kind of design work for Google, and its stock is up about 10% so far this year and roughly sevenfold since the end of 2022. 

According to CNBC, it climbed further following the announcement.

For Nvidia, it's another data point in a trend that's been building for a while. Google has its TPUs, Amazon has Trainium, and now OpenAI has Jalapeño. None of these companies is abandoning Nvidia GPUs outright; demanding work like pre-training will likely continue to rely on Nvidia hardware for the foreseeable future, but the list of customers willing to build alternatives keeps growing. 

The caveats

It's worth being clear about what hasn't happened yet.

OpenAI has not published final performance benchmarks; the company says a detailed technical report is still months away. "Engineering samples" running in a lab are not the same as chips operating at scale in a live data center handling real user traffic. 

First-generation custom chips have a track record of underdelivering on their early promises; even experienced chipmakers often need a few hardware revisions before a design meets its specs in production.

Timing is also looser than the announcement might suggest. Broadcom's CEO described 2026 as a year for small prototype runs, with meaningful scale not arriving until 2027 and full-tilt production not expected until early 2028. That's a multi-year runway before Jalapeño meaningfully changes OpenAI's cost structure, plenty of time for Nvidia, Google, or AMD to ship their own next-generation answers.

Also read: OpenAI’s Patch the Planet program offers funding and technical support for open-source security projects, including work on software supply chains, memory safety, and critical infrastructure.

Aminu Abdullahi

Aminu Abdullahi is an experienced B2B technology and finance writer and award-winning public speaker. He is the co-author of the e-book, The Ultimate Creativity Playbook, and has written for various publications, including TechRepublic, eWEEK, Enterprise Networking Planet, eSecurity Planet, CIO Insight, Enterprise Storage Forum, IT Business Edge, Webopedia, Software Pundit, Geekflare and more.

eWeek Logo

eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site's focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.