sdlcnext.com
← All posts
moonshots ai-infrastructure semiconductors cerebras google-io

Moonshots Ep. 256: Money Can't Buy the Frontier

Cerebras IPO'd at $95B, Karpathy joined Anthropic, Google spent $190B in capex. Episode 256's actual lesson is that the moat is tacit knowledge, not capital.


Viewpoint

Cerebras IPO’d at a $95 billion market cap, up 68 percent on day one, the biggest US tech IPO since Uber in 2019. Andrej Karpathy publicly job-hunted on a podcast, then joined Anthropic three weeks later. Google announced $180 to $190 billion of 2026 capex at I/O. Every one of those moves looks like a different story. They are the same story.

Episode 256 is the panel’s annual Google I/O recap with Cerebras CEO Andrew Feldman in studio. Sundar dropped numbers that should have been the headline: 3.2 quadrillion tokens a month, 7x year-over-year, 13 products with over a billion users, more than a million TPUs across multi-site training clusters. Inside the same week, a federal jury threw out Elon’s OpenAI suit in two hours of deliberation. Karpathy moved labs. Andrew rang the NASDAQ bell. The throughline Feldman kept pulling the panel back to is unsexy and almost impossible to fake. The frontier is gated by tacit knowledge that cannot be bought.

Capital is the easy part now

OpenAI raised $120 billion in cash. Anthropic is on a similar trajectory. Google is putting $190 billion of capex into the ground in a single year. Cerebras came public with $5.5 billion in fresh equity on top of a $20 billion OpenAI supply deal signed in December 2025 and an AWS term sheet in March. The war chests are no longer different in kind. The labs that were called cooked a year and a half ago are sitting on more cash than the ones doing the cooking.

That changes the question. If money is no longer the constraint, then the next two years get decided by something else. Feldman’s framing is direct: “Money and the acquisition of talent isn’t enough.” His proof points are uncomfortable. Intel said no to Apple’s cell-phone chip and destroyed tens of billions of shareholder value chasing the consequence. NVIDIA failed at Snapdragon, failed at Northbridge, and succeeded beyond expectations at exactly one thing. The Yankees and the big-money NFL teams don’t always win. There is something inside an organization that has to be built and cannot be hired in.

Capital abundance versus tacit-knowledge scarcity stacked vertically

Cerebras’ 18-month failure proves it

The Cerebras founding story is the cleanest version of the argument anyone made on the show. The team made two contrarian bets in 2015: AI would need dedicated silicon the way graphics needed GPUs, and the right answer was a clean-sheet design rather than a GPU derivative. Both bets were laughed at. The team then committed to a chip the size of a dinner plate, 58 times larger than any chip ever fabbed, stuffed with 40 to 50 gigabytes of SRAM. Four years. Roughly $450 million. Even Gene Amdahl had failed at this before.

The part Feldman lingered on is the 18-month stretch where they were spending $8 million a month and could not solve a single packaging problem. Board meeting every six weeks. Still cannot solve it. $100 million in the hole. $120 million. $140 million. They solved it in 2018. Seven years later TSMC and NVIDIA hit the same coefficient-of-thermal-expansion wall on the CoWoS process and pushed B200 18 months late. Pioneering work surfaces problems early. The grit to keep paying $8 million a month into a black box is the part of the moat that doesn’t get described well in pitch decks.

The world ignored the result. First-generation Cerebras systems shipped 12 units. Second generation, 300 to 350. Third generation, many thousands, then a $20 billion OpenAI deal. The inflection was not their engineering. It was the inference market arriving in late 2024 and early 2025 and finding them already sitting at the only available answer.

The talent gravity well

Karpathy joining Anthropic is the clip the panel kept circling. He sat on No Priors and openly job-hunted: “If you’re outside of that frontier lab, your judgment fundamentally will start to drift because you’re not part of what’s coming down the line.” Three weeks later he was on Anthropic’s pre-training team running a Claude-accelerates-Claude initiative.

Dave’s read landed hardest. Every other OpenAI co-founder has either raised billions or runs a frontier lab. Karpathy was the last one without access to the big machine, and an open-source auto-research repo was no substitute. Feldman generalised it to hardware: if you are not building silicon engaged at a fundamental level with one of Google, Anthropic, or OpenAI, your hardware will drift away from what they need just as surely as a researcher’s instincts drift away from the curve. Salim noted the timing. Karpathy was announced on the same day as Google I/O. The choice of Anthropic over OpenAI or DeepMind is its own data point about where the smartest people think the inference-train compounding loop is tightest.

The labs that win do not win because they bought Karpathy. They win because there was a place for him to go that he wanted to be at. That is also tacit and almost impossible to manufacture.

Talent gravity flowing toward the three labs with the tightest compounding loops

Terra Fab is a 20-year project, not a 5-year one

Elon’s Terra Fab pitch is to produce 50 times the chips on the planet, outstripping TSMC. Feldman, who knows this domain better than anyone on the panel, was respectful and clear: “It is not a five- or ten-year project in my humble view. I’ve been wrong before, but I put this at a 15- or 20-year project.”

The reason is the part that gets glossed over. Even with the exact same ASML equipment, Samsung is not at the same node as TSMC. The amount of received wisdom and learning that gets baked into a fab across generations cannot be bought from the equipment vendor. The packaging supply chain that surrounds the fab matters as much as the fab itself. RDL deposition at ASE. Materials manufactured in Japan, including Kyocera. The US lost this whole ecosystem in the 90s when IBM and Global Foundries walked away from manufacturing, and nobody noticed because the silicon-as-virtual-resource framing won the VC narrative between 2007 and 2016.

Cerebras has committed its 3-nanometer design to TSMC. Some components are at Samsung. Intel is not yet a credible option for them, and Feldman likes Lip-Bu Tan personally. That is what the actual frontier looks like from inside a company that has shipped real wafers. The romance of pouring capital into a sovereign fab cannot rebuild a multi-generational packaging supply chain in five years.

The fab supply chain: lithography, packaging, materials, deposition

Google bought time but not the lead

Sundar’s keynote was the most aggressive full-stack performance Google has ever delivered. Gemini at 900 million monthly active users, more than doubled in a year. NanoBanana past 50 billion images. SynthID on 100 billion images and videos plus 60,000 years of audio, now adopted by OpenAI, Kakao, and Eleven Labs. Gemini Omni as the only credible American frontier multimodal model after OpenAI cut Sora and Anthropic walked away from anything that is not code. Gemini for Science with WeatherNext hurricane prediction and Isomorphic Labs running multiple preclinical immune and cancer programs.

The candid part nobody on the panel danced around: Silicon Valley scuttlebutt is that it is now a two-horse race between OpenAI and Anthropic for raw capability, and Gemini 3.5 Flash is solidly mid. Sundar compared Flash to 3.1 Pro instead of to frontier models. He picked throughput versus performance as the axis because that is the most flattering frame. Pro is another month out. Google DeepMind engineers are reportedly desperate for Claude Code access. Anti-gravity 2.0 is a Cursor fast-follow, Gemini Spark is an OpenClaude fast-follow, and the universal cart is a shot at Amazon retail that Amazon will not adopt.

What Google does have, and what nobody else can match, is the integration. Over a billion users on 13 different products. Search, Gmail, Sheets, YouTube, Android, Pixel. The Microsoft-versus-Netscape lesson Dave pointed to is exactly right. Google exists because the FTC stopped Microsoft from tying IE to Windows. Now Google is doing the same tying move, perfectly legally, with Gemini integrated across the entire stack. The capital and the distribution buy time. They have not bought the lead.

Sam saw the corner first, again

The line Feldman dropped that the other guests stopped to absorb was about Sam Altman locking down data center capacity in space “last year and the year before, when all the other foundation labs didn’t see it.” That is reporting from someone with quite a lot of visibility into who is buying compute and why.

Alex pulled the thread. If Elon’s Dyson swarm is 500,000 to a million satellites with a Starship launch every couple of minutes, then heavy launch becomes part of the AI compute stack the same way fabs are. Bezos and Blue Origin’s New Glenn is closer to Falcon 9 and does not hold a candle to Starship on cost per kilogram. A multi-channel strategy for Sam is SpaceX, Blue Origin, and a long-game bet that fabs themselves end up in low Earth orbit and on the moon. If you believe the singularity exponential, you do not need to be addicted to terrestrial heavy launch five to ten years from now.

Feldman’s softer take on orbital compute: production silicon in space is seven to ten years out. Communication and clustering are the hard parts, not radiation. Wafer-scale fault tolerance maps neatly to ionising-radiation hardening. Cerebras already shuts down faulty cores and routes around them. The advantage is real. The timeline is not 2027.

What’s actually being decided

Karpathy moved because the curve is steepest where the tacit knowledge is densest. Cerebras IPO’d because they grit-paid $8 million a month into an unsolved problem for 18 months and emerged with the only chip on Earth that ships 1,000 tokens per second on Kimi K2 across a handful of wafers. Google is throwing $190 billion of capex at a stack that catches up on infrastructure but does not yet catch up on capability. Sam is buying spectrum, orbit, and option value years before everyone else realises those are the constrained inputs.

The honest counterargument is that capital still matters and that some of these gaps close with brute force. NVIDIA built the Bay Area’s most expensive talent machine and nobody is going to catch them on aggregate GPU throughput this decade. TSMC is a $50 billion, six-year project per fab, and that is exactly why the lead compounds. Recursive self-improvement from frontier labs designing chips that train the next generation of frontier models could collapse the timelines that the panel quoted in years.

Both can be true. Capital is necessary and not sufficient. The teams that win the next two years are the ones who built the institutional muscle to keep paying $8 million a month into an unsolved problem and saw, before anyone else, that the input nobody was buying was the one about to become scarce.

What Episode 256 actually argued is that the frontier has stopped being a leaderboard. It is a question of which organisations have the patience, the supply-chain depth, and the talent gravity to compound for a decade. Andrew Feldman just got rewarded for compounding. The next decade rewards the same trait at higher stakes.


Sources

Comments

Loading comments…

Leave a comment