Meta Was Running on Google’s AI – Until Google Said No

Google’s capacity limits in March 2026 forced Meta to ration AI tokens and accelerate its own Muse Spark moderation model

Jun 29, 2026

2 min read

Key Takeaways

Meta relied on Google’s Gemini for ads, moderation, and coding tools internally.
Google capped Meta’s compute access around March 2026 due to unprecedented demand.
Meta now invests up to $600 billion in data centers to reduce external AI dependency.

Behind Meta’s open-source AI announcements, a quieter story was unfolding. Customer service, advertiser chatbots, scam detection, content moderation, internal coding tools — all running, at least partly, on Google’s Gemini models. Then around March 2026, according to the Financial Times, Google told Meta it couldn’t supply the compute Meta wanted. The AI self-sufficiency narrative cracked like a phone screen on day one.

When Your Rival’s AI Keeps the Lights On

Meta’s internal AI stack relied heavily on Google’s Gemini — and the dependency ran deeper than anyone outside the company knew.

The scope of Meta’s Gemini dependency is striking. Sources told the Financial Times that Gemini was chosen because it was “performing better than Meta’s own models” for specific internal tasks. Meta also ran Anthropic’s Claude alongside Gemini and Llama — hedging bets across providers rather than committing to a single external source.

Here’s what the reporting reveals:

Gemini powered customer service, ad tools, coding assistance, and harmful content takedowns across Meta’s platforms
Google imposed capacity limits around March 2026 after Meta requested more compute than available
Meta’s demand was unusually high compared to other Google Cloud customers
Employees were told to conserve “AI tokens” — the units measuring how much model capacity each request consumes
Meta began shifting workloads to Muse Spark, its own internal model built for content moderation

The Token Wall Nobody Talked About

Global AI compute demand is now growing faster than even the largest infrastructure providers can build to meet it.

This isn’t Google punishing Meta. It’s structural. Google Cloud‘s AI backlog now exceeds $460 billion, according to Alphabet’s recent results. Its models process over 16 billion tokens per minute — up 60% quarter-over-quarter. Even after enormous data center investment, demand is outrunning supply. It’s the GPU shortage during the crypto boom, except now the scarce resource is inference capacity for models that half the Fortune 500 wants running simultaneously.

Meta’s response has been swift — possibly forced. The company is investing up to $600 billion in US data center capacity through 2028, building custom MTIA chips with Broadcom. This mirrors the ambition behind the Stargate Project, as the race to control AI infrastructure accelerates across the industry. Separately, per CNBC, Meta is replacing third-party human moderation vendors like Accenture and Concentrix with AI systems. Muse Spark is absorbing moderation duties that Gemini once handled.

The uncomfortable reality: “AI-native” is doing heavy lifting as a marketing term. Behind the keynotes, the biggest platforms are buying critical AI capacity from the same companies they’re racing against. Meta builds toward self-sufficiency, but Meta’s token-rationing memo is this era’s equivalent of discovering your favorite restaurant sources its signature bread from the shop next door. What happens next depends entirely on who controls the ovens — and right now, that list is very short.