Silicon Valley’s $2.1B Bet on the Next Superintelligence | ReflectionAI, Misha Laskin

Key Concepts

Artificial Super Intelligence (ASI): A hypothetical intelligence that surpasses human intelligence across virtually all fields.
Move 37: A pivotal moment in the AlphaGo vs. Lee Sedol match where AlphaGo made a seemingly erroneous move that later proved to be a stroke of genius, demonstrating AI's potential for creativity and novel strategy.
Autonomous Coding: The ability of an AI system to independently write and complete code from a given task, seen as a key to unlocking broader superintelligence.
Reinforcement Learning (RL): A type of machine learning where an agent learns to make decisions by taking actions in an environment to maximize a reward.
Large Language Models (LLMs): AI models trained on vast amounts of text data, capable of understanding and generating human-like language.
RLHF (Reinforcement Learning from Human Feedback): A technique used to align AI models with human preferences and values.
System One and System Two Thinking: A cognitive framework where System One is fast, intuitive, and emotional, while System Two is slow, deliberate, and logical.
Foundational Ingredients: Core scientific principles or technologies that enable subsequent advancements.

The Dawn of Artificial Super Intelligence and the Role of Autonomous Coding

The speaker, Misha, CEO and co-founder of Reflection, posits that humanity is on the cusp of Artificial Super Intelligence (ASI), a future where AI systems will surpass human capabilities across all domains. This vision is deeply informed by the "Move 37" incident during the AlphaGo match against Lee Sedol. This move, initially perceived as a mistake, was later revealed to be a profoundly creative and superior strategy, demonstrating AI's capacity for discovering novel approaches beyond human comprehension. Misha draws a parallel between this event and the potential for future AI systems to generate "move 37s" across all fields of knowledge work, expanding human creativity and understanding.

The Path to Super Intelligence: Autonomous Coding as the Key

Reflection's core belief is that solving the problem of autonomous coding is the most direct path to achieving broader superintelligence. This approach stems from the realization that two key ingredients have matured: Large Language Models (LLMs) for their broad general capabilities and Reinforcement Learning (RL) for scaling autonomy. The combination of these technologies, Misha argues, can lead to highly capable, intelligent, and autonomous systems, rather than just co-pilots or chat assistants.

From Physics to AI: A Journey of Foundational Discovery

Misha's personal journey into AI is rooted in a fascination with physics and its role as a foundational science. He was driven by the desire to work on the "root node" of scientific breakthroughs that enable future technologies, citing examples like the transistor enabling computers and relativity enabling GPS. However, during his PhD, he observed that impactful physics research was often decades old, leading him to seek a more dynamic field. The emergence of deep learning and the AlphaGo breakthrough in AI presented this opportunity. He self-taught AI for several months, recognizing deep learning and reinforcement learning as the foundational ingredients for the next generation of impactful technologies.

The Physics of Entrepreneurship: Identifying Core Principles

Misha draws a strong analogy between entrepreneurship and scientific research, emphasizing the importance of identifying and focusing on core principles. In research, this means reducing complex problems like autonomy to a few guiding principles. In company building, it involves identifying the one or two fundamental problems that truly move the needle across research, product, and customer needs. This "physics-like" framework helps in prioritizing efforts and avoiding getting lost in numerous potential solutions.

Reflection's Approach: Autonomy, Real-World Evaluation, and Focused Teams

Unlike large labs that might focus on building more capable chatbots, Reflection's primary interest, even before their time at DeepMind, has been autonomy. They believe that evaluating AI's effectiveness requires real-world interaction with customers and product development. This led them to form a smaller, more focused team to move faster and maintain a close coupling with product and customers, enabling them to steer research effectively. Misha highlights the difficulty of course-correcting large, established organizations, making a startup approach more agile for their specific mission.

The Era of Scaling: Simple Ideas, Great Craftsmanship

A key insight from building large-scale models like Gemini is the shift from complex, sophisticated ideas to simple ideas implemented with great detail and craftsmanship. In contrast to older AI systems like IBM's Deep Blue, which relied on complex tree structures, training LLMs involves simple objectives like predicting the next token. The success lies in the meticulous execution and robust infrastructure to support these massive models. Misha emphasizes that RLHF, a technique he and his co-founder led, utilizes relatively simple algorithms compared to earlier RL research.

Autonomous Coding: Transcending Software Engineering

Misha presents an unconventional perspective on the importance of autonomous coding. While many see its utility for software engineers, Reflection believes coding will transcend software engineering and touch every other category of computer-based work. The reasoning is that coding is the most intuitive interface for LLMs, which lack the evolved spatial reasoning of humans. As LLMs interact with software, it will likely be through programmatic interfaces rather than traditional UIs. Solving autonomous coding, therefore, is seen as solving intelligence on a computer.

The Future of Work: Architects of an AI Workforce

Misha views the impact of superintelligent AI not as a zero-sum game but as an expansion of human capability. He anticipates that AI will "lift everything up," enabling an order of magnitude more creation in computer-based work and eventually physical work. In this future, humans will transition from executors to architects, managing an AI workforce. The critical skill will be asking the right questions and designing projects correctly, as AI will handle the execution. This is analogous to the current challenge of identifying the right problems to solve in entrepreneurship and research.

The Art of Asking the Right Questions

The ability to ask the right questions is paramount, especially in a future with highly competent AI. Misha illustrates this with examples: discovering AI breakthroughs requires asking the correct questions, and the shift from optimizing training steps in RL to inventing LLMs demonstrates a fundamental change in problem framing. He admits that consistently picking the "right thing" is challenging, but frameworks like clarity of thought, often achieved through writing and iterative revision, are crucial. Discussing ideas critically with trusted individuals who challenge assumptions is also vital.

Startup Challenges: Clarity, Talent, and Motivation

Misha outlines key challenges for early-stage startups:

Clarity of Direction: Reducing a blank slate into a focused, directed path that aligns with the long-term mission.
Attracting Talent: The best way to build a stellar team is to hire stellar people. Early hires must be high-caliber individuals with whom there is great trust, as "good people beget good people."
Motivation: This requires both an ambitious, compelling long-term mission (like building super intelligence) and clarity on the short-term strategy and "wedge" into that mission. Startups must articulate why their approach is correct when alternatives exist.

Dealing with Setbacks: Care, Momentum, and Learning

Misha's approach to setbacks is rooted in deeply caring about the mission and the people involved. He views setbacks not as failures but as learning opportunities, especially in research where uncertainty is inherent. The key is to persist as long as the problem is interesting and the approach is fruitful, adapting based on new evidence. He emphasizes the importance of momentum, taking action, and making progress. For startups, customer feedback that reveals unexpected value is not a setback but an acceleration of learning.

The Impact of People: Surrounding Yourself with the Right Network

A crucial piece of advice Misha offers is the underappreciated importance of surrounding yourself with the right people. Just as in personal life, high-quality friends are essential, so too are high-quality colleagues in professional endeavors. He credits his ability to learn quickly and develop his thinking to being in Peter Thiel's lab at Berkeley, surrounded by talented individuals, despite not being an AI person at the time. Demonstrating genuine desire through action, not just words, is key to gaining access to influential people. Persistence and a clear, concrete contribution can open doors.