Pull to refresh
Logo
Daily Brief
Following
Why Ranks Sign Up
The race to build non-Nvidia AI inference chips

The race to build non-Nvidia AI inference chips

Money Moves

A wave of startups raises billions on bets that custom silicon can run frontier AI models faster and cheaper than GPUs

May 13th, 2026: Fractile closes $220M Series B

Overview

Nvidia sells roughly four out of every five chips running today's large AI models. Investors are now writing nine-figure checks on the bet that the workload coming next, running those models rather than training them, will move to different silicon.

On May 13, London-based Fractile closed a $220 million Series B led by Accel, Factorial Funds and Founders Fund. It joins Groq, Cerebras, Etched and a half-dozen other startups promising 10x to 25x gains on inference cost or speed. Frontier labs including Anthropic are shopping for alternatives, giving the challengers a credible path to revenue if their chips ship.

Why it matters

If these chips work, the cost of running frontier AI drops sharply, and Nvidia's near-monopoly on the most profitable workload in computing weakens.

Questions about this story

No questions yet — be the first to ask.

Play on this story Voices Debate Predict

Key Indicators

$220M
Fractile Series B
Round led by Accel, Factorial Funds and Founders Fund, with Conviction, Gigascale, Felicis and 8VC.
25x
Claimed inference speedup
Fractile says its first chip will run frontier models 25 times faster than current GPUs.
~90%
Claimed cost reduction
Fractile targets running frontier inference at roughly one-tenth the cost of GPU-based systems.
2027
Fractile launch year
Capital funds chip tape-out, software stack and early customer integrations ahead of a 2027 product launch.
~80%
Nvidia data-center AI share
Industry estimate of Nvidia's share of AI accelerator revenue — the share challengers are trying to chip away at.

Voices

Curated perspectives — historical figures and your fellow readers.

Ambrose Bierce

Ambrose Bierce

(1842-1914) · Gilded Age · wit

Fictional AI pastiche — not real quote.

"Nine-figure fortunes staked on the certainty that the king of a hill will presently be displaced — this is not investment but rather the ancient ritual of ambitious men paying to watch other ambitious men fail. That four in five chips bear one maker's mark is called a monopoly by the envious and an ecosystem by the beneficiary; that investors now wager two hundred millions on the word "inference" suggests they have mastered the vocabulary of the future without troubling themselves to understand it."

Ever wondered what historical figures would say about today's headlines?

Sign up to generate historical perspectives on this story.

Play

Exploring all sides of a story is often best achieved with Play.

Log in to play. Track your picks, climb the leaderboards. Log in Sign Up
Predict 5 ways this could play out. Contrarian picks score more — points lock when the scenario resolves. Log in to play
Timeline Five events from this story — drag them oldest to newest. Log in to play
Connections Sixteen names from the news. Find the four hidden groups of four. Log in to play

People Involved

Organizations Involved

Timeline

May 2016 May 2026

7 events Latest: May 13th, 2026 · 1 month ago
Tap a bar to jump to that date
  1. Fractile closes $220M Series B

    Latest Funding

    London-based Fractile raises $220M led by Accel, Factorial Funds and Founders Fund. Capital funds chip tape-out and software ahead of a 2027 commercial launch targeting 25x faster, ~90% cheaper frontier inference.

  2. Anthropic, Amazon deepen Trainium partnership

    Customer deal

    Amazon and Anthropic announce an expanded deal that puts more Claude inference on Amazon's Trainium chips. It is the clearest signal yet that frontier labs are willing to move workloads off Nvidia.

  3. Cerebras files for IPO

    Corporate filing

    Cerebras Systems files S-1 paperwork to go public, the first major Nvidia challenger to attempt the public markets. The offering is later delayed by regulatory review.

  4. Groq raises $640M at $2.8B valuation

    Funding

    Groq closes a major growth round led by BlackRock to scale its inference cloud and Language Processing Unit chips.

  5. Etched raises $120M for transformer-only chip

    Funding

    Etched closes a $120M round to build Sohu, a chip designed to run only transformer-architecture models. The bet: betting on a single architecture buys huge efficiency gains.

  6. Fractile founded in London

    Company formation

    Walid Mehri co-founds Fractile to build chips designed for AI inference, drawing on neural-network hardware research from Oxford.

  7. Google reveals the TPU at I/O

    Technology

    Google announces it has been running custom AI silicon, the Tensor Processing Unit, in production. It is the first public proof that purpose-built chips can outperform GPUs on AI workloads.

Historical Context

3 moments from history that rhyme with this story — and how they unfolded.

May 2016

Google launches the TPU (2016)

At Google I/O, Google revealed it had been running a custom AI chip called the Tensor Processing Unit in its data centers since 2015. It was the first time a major operator publicly claimed that purpose-built silicon could beat Nvidia GPUs on AI workloads at scale.

Then

The TPU opened a credible alternative path for AI compute and validated the thesis that workload-specific chips could compete with general-purpose GPUs.

Now

TPUs became central to Google's AI infrastructure and inspired a generation of custom-silicon efforts at Amazon (Trainium, Inferentia), Microsoft (Maia) and Meta (MTIA). It also made the inference chip startup category investable.

Why this matters now

Every Fractile, Groq and Etched pitch deck traces back to the TPU's central claim: GPUs are not the right shape for AI, and a purpose-built chip can win. The 2026 funding race is the venture-backed extension of that 2016 idea.

1995-2002

The 1990s graphics chip wars

Through the late 1990s, a crowded field of graphics chip makers, including 3dfx, ATI, Matrox, S3, Trident and a young Nvidia, fought to define the PC 3D graphics market. Each company pitched a different architecture and a different bet on what gamers and developers would adopt.

Then

Pricing collapsed, marginal players failed, and the market consolidated faster than investors expected. 3dfx, the early leader, went bankrupt by 2002.

Now

Two winners, Nvidia and ATI (later AMD), emerged with durable share. The lesson: in chip categories with high R&D costs and software lock-in, late-cycle consolidation is brutal and most well-funded entrants do not survive.

Why this matters now

Today's inference chip field looks structurally similar: many funded entrants, competing architectures, no clear winner, and a software moat held by the incumbent. The 1990s suggest the next five years will end with two or three survivors, not ten.

November 2019

Amazon launches AWS Inferentia (2019)

Amazon unveiled Inferentia, a custom inference chip built in-house for AWS, and later added Trainium for training. The chips were aimed at lowering Amazon's own compute costs and offering customers a cheaper alternative to Nvidia inside AWS.

Then

Inferentia gained limited adoption initially as customers stuck with familiar GPU tooling.

Now

By the mid-2020s, Trainium and Inferentia were central to AWS's AI pitch and underpinned the deeper Amazon-Anthropic partnership announced in 2024. Custom hyperscaler silicon proved viable at scale.

Why this matters now

Hyperscalers can and do build their own inference chips. That sets a ceiling on how much of the inference market the independent startups can address: the biggest buyers may bring the workload in-house rather than buy from Fractile or Groq.

Sources

(4)