The tech takeoff: Altimeter's Brad Gerstner on Nvidia, the Cerebras IPO and the future of AI
By CNBC Television
Key Concepts
- Inference: The process of running a trained AI model to generate predictions or "tokens."
- Tokens: The fundamental units of output in AI models; the production and consumption of these tokens represent the core of AI intelligence.
- Wafer-Scale Engine: A specialized chip architecture (pioneered by Cerebras) that places memory directly on the chip to overcome the "memory wall."
- Memory Wall: A technical bottleneck where the speed of moving data between memory and compute limits overall performance.
- FLOPS (Floating Point Operations Per Second): A measure of computer performance, critical for the pre-training phase of Large Language Models (LLMs).
- Latency: The time delay between a request and the generation of an AI response; low latency is critical for real-time AI applications.
1. Market Overview and NVIDIA’s Performance
The market is currently experiencing significant momentum, with the S&P 500 and NASDAQ hitting new highs and the Dow Jones reclaiming the 50,000 level. NVIDIA, a dominant force in the AI sector, has reached a new record high.
- Valuation and Growth: Despite being a market leader, NVIDIA’s stock remained stagnant at approximately $180 for six months, trading at a relatively low multiple (14–15x GAAP earnings) compared to the broader semiconductor sector.
- Market Sentiment: Analysts remain highly optimistic, with price targets from firms like UBS ($275) and Cantor Fitzgerald ($350).
- China Market: While there is interest in NVIDIA’s ability to sell H200 chips in China, experts argue this is "de minimis" (negligible) to NVIDIA’s overall growth, which is driven by massive demand for Blackwell and Vera Rubin architectures.
2. The Shift from Training to Inference
A major theme discussed is the industry's transition from "pre-training" LLMs to "inference."
- The Industrial Revolution of AI: Experts compare the current AI boom to an industrial revolution, suggesting that the demand for inference will grow by a "billion-fold."
- The Role of Inference: As AI agents and complex reasoning models (like those using "chain of reasoning") become standard, the demand for low-latency token production has exploded.
- Economic Reality: Five major S&P 500 companies recently noted that the cost of compute and token production is becoming a material factor in their enterprise profitability.
3. Competitive Landscape: Cerebras vs. NVIDIA
The discussion highlights the IPO of Cerebras as a significant event, representing a different approach to AI hardware.
- Differentiation: While NVIDIA dominates with its GPU clusters, Cerebras focuses on "inference factories" using wafer-scale chips.
- The Memory Wall: Cerebras’s competitive advantage lies in its ability to place memory directly on the wafer-scale chip, solving the "memory wall" problem that limits traditional architectures.
- Market Capacity: The speakers argue that the AI market is large enough to support multiple winners. The primary constraint for both NVIDIA and Cerebras is not necessarily competition, but the physical availability of power and data center infrastructure to support the massive compute demand.
4. Investment Philosophy and Strategy
- Hiding in Plain Sight: The speakers emphasize that NVIDIA was a "buy" for months before the recent rally, yet many investors failed to capitalize on it.
- Long-term Conviction: The investment in Cerebras, made 8–9 years ago, was considered a "long shot" based on the thesis that the industry would eventually require specialized hardware to handle the massive data movement required for AI.
- Market Efficiency: The speakers suggest that if a company delivers on its performance promises, the market will eventually correct its valuation, or the company will be forced to buy back its own shares.
5. Notable Quotes
- On the scale of AI growth: "Inference is about ready because of chain of reasoning... It's about to go up by a billion times." — Jensen Huang (referenced in clip)
- On the nature of AI: "There is no intelligence... no consumer ChatGPT... no enterprise intelligence... without the production of tokens." — Panelist
- On market competition: "In an industrial revolution... it's not going to be one winner. There's going to be many big winners." — Panelist
Synthesis and Conclusion
The current market environment is defined by a massive shift toward AI inference, moving beyond the initial phase of model pre-training. While NVIDIA remains the dominant player, the emergence of companies like Cerebras highlights a race to solve the "memory wall" and reduce the cost of compute. The primary bottleneck for the industry is no longer just software capability, but the physical infrastructure—specifically power and data center capacity—required to sustain the exponential growth of token production. Investors are encouraged to look for companies that provide essential, purpose-built solutions to these scaling challenges.
Chat with this Video
AI-PoweredLoad the transcript when you're ready to chat so the initial page stays lighter.