"okay, but I want Gemini3 to perform 10x for my specific use case" - Here is how
By AI Jason
Share:
Key Concepts
- Reasoning Models: AI models (like Gemini 3) that utilize "reasoning tokens" to process information, requiring concise, direct prompts rather than overly complex, context-heavy ones.
- Distributional Convergence: The tendency of AI models to default to "safe," generic, and statistically common outputs based on their training data.
- Convergent Defaults: The specific, often suboptimal, default behaviors a model exhibits for a given task.
- Prompt Altitude: The level of abstraction in a prompt; finding the balance between being too specific (which leads to overfitting) and too vague (which leads to generic results).
- Iterative Loop: A systematic process of testing, identifying root causes of poor output, and refining prompts to steer model behavior.
1. Understanding Gemini 3 and Reasoning Models
Gemini 3 is classified as a reasoning model. Unlike traditional LLMs that benefit from "fully packed" prompts containing extensive context, reasoning models perform better with concise, clear instructions.
- The Over-Prompting Trap: Providing overly complex prompts can cause the model to overanalyze variables or become constrained by the specific process outlined in the prompt, leading to degraded performance.
- Steerability: Despite needing conciseness, Gemini 3 is highly sensitive to simple keywords. Adding a single stylistic reference (e.g., "linear style") can fundamentally transform the output quality.
2. The "Distributional Convergence" Framework
The video highlights a methodology popularized by Anthropic for improving front-end design outputs, which can be applied to any task (debugging, data analysis, etc.).
- The Problem: Models default to "safe" design choices (e.g., boring fonts like Roboto or Inter, generic blue/purple color schemes) because these patterns dominate the training data.
- The Solution: Identify these "convergent defaults" and provide concrete, high-quality alternatives.
3. Three-Step Iterative Process for Prompt Engineering
To achieve high-quality, consistent outputs, follow this systematic loop:
- Identify Convergent Defaults: Run a "bare minimum" prompt to see the model's default behavior. Identify specific areas where the output falls short (e.g., typography, layout, or schema errors).
- Find the Root Cause: Instead of just telling the model what not to do, ask it (or use a secondary model) to explain why it made a specific choice.
- Example: If the model sets a width to "0," ask it why. It may reveal a misunderstanding of the underlying technical schema (e.g., expecting dynamic resizing that doesn't exist in the target format).
- Structure Guidance at the Right Altitude: Avoid listing step-by-step instructions (which leads to overfitting). Instead, provide the reasoning behind the desired behavior.
- Example: Instead of listing every property to include, instruct the model: "Only output properties that impact styling; never output metadata like seed or version numbers."
4. Real-World Application: Excalidraw Wireframes
The speaker demonstrates this process by training an agent to generate high-quality Excalidraw wireframes:
- Initial Failure: The model produced incorrect JSON schemas and invalid coordinate formats.
- Debugging: By using "debug mode" and cross-referencing with other models (like GPT), the speaker identified that the model was using incorrect properties for text elements.
- Refinement: The prompt was updated to provide the correct logic for text alignment and container sizing, resulting in accurate, professional-grade wireframes.
5. Notable Quotes and Perspectives
- "The more prompts you give Gemini, the worse the performance." — On the nature of reasoning models.
- "Once the model improves one aspect of the design, it generally starts to improve across all other behaviors." — On the ripple effect of iterative prompt refinement.
- "Don't just list out specific steps... articulate the reasoning behind the behavior." — On maintaining the correct prompt altitude.
6. Synthesis and Takeaways
- Quality over Quantity: Stop "stuffing" prompts with context. Focus on identifying the specific "default" behaviors you dislike and replacing them with clear, reasoned alternatives.
- Iterative Refinement: Treat prompt engineering as a loop. Test, debug the root cause, and refine the guidance.
- Domain Knowledge is Essential: To effectively steer a model, you must understand the underlying domain (e.g., the JSON schema of Excalidraw or CSS best practices).
- Tooling: The speaker recommends using the Anthropic "Front-end Design" skill (available in the Claude Code plugin marketplace) as a template for how to structure professional-grade system prompts.
Chat with this Video
AI-PoweredLoad the transcript when you're ready to chat so the initial page stays lighter.