📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google reveals that 85% of developers use AI coding tools, but the real value lies in configuring the surrounding system, not the AI model itself. The paper advocates focusing on harness and context engineering over chasing the latest models.

A new Google whitepaper titled The New SDLC With Vibe Coding emphasizes that the AI model accounts for only around 10% of the overall system behavior. The paper argues that the real value in AI-driven development lies in configuring the harness and engineering context, which together determine 90% of outcomes. This shift has significant implications for how organizations approach AI integration and development strategies.The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the dominant factor in AI system performance is how the AI is configured and integrated, not the model itself. It presents evidence from public benchmarks showing that small adjustments to the harness—such as prompts, tools, and rules—can dramatically improve performance, even with the same model. The authors distinguish between vibe coding, characterized by minimal structure and review, and agentic engineering, which involves formal specifications, verification, and oversight, leading to more reliable outcomes. They argue that costs are primarily driven by configuration and context management, not the AI models, which are relatively inexpensive and rapidly evolving. This perspective shifts the focus from chasing the latest models to investing in system design, tooling, and context engineering to achieve better and more cost-effective results.

At a glance

reportWhen: published early 2026

The developmentThe Google whitepaper highlights that in AI software development, the AI model contributes only about 10% to system behavior, shifting focus to harness and context engineering.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Implications for AI Development Strategies

This whitepaper challenges the common narrative that the AI model itself is the primary driver of system performance. Instead, it highlights that configuration, harness design, and context management are the key factors. Organizations that focus on improving these areas can achieve greater reliability and cost efficiency than those merely upgrading to newer models. This insight encourages a strategic shift towards system engineering in AI projects, emphasizing verification, tooling, and context architecture. As a result, companies can reduce costs, improve security, and enhance AI reliability by investing in these areas instead of solely chasing model improvements.

Amazon

AI system configuration tools

As an affiliate, we earn on qualifying purchases.

Background on AI System Design and Evolving Practices

Historically, AI development emphasized model innovation, with organizations racing to adopt the latest architectures and training techniques. However, recent trends show widespread adoption of AI coding agents—by early 2026, 85% of professional developers use them regularly, and 41% generate most code with AI. The whitepaper builds on this trend, arguing that the focus should shift from model performance to how AI systems are configured via prompts, tools, and rules. Previous approaches, like vibe coding, relied on minimal oversight, leading to inefficiencies and higher costs over time. The new paradigm, called agentic engineering, integrates formal verification, testing, and context management, making AI development more disciplined and cost-effective.

“The biggest shift in software engineering isn’t a new language or framework; it’s moving from writing code to expressing intent and trusting machines to interpret that intent.”
— Addy Osmani

Amazon

AI prompt engineering software

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Implementation and Industry Adoption

It remains unclear how quickly organizations will adopt the recommended focus on harness and context engineering over model chasing. The long-term impact on AI development costs and security practices is still being evaluated, and some industry players may resist shifting away from model-centric strategies.

Amazon

AI development system engineering tools

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations and Developers

Organizations should prioritize building and refining their harnesses, including prompts, tools, and verification processes. Future research and industry practice are likely to focus on developing standardized frameworks for context engineering and cost-effective system design. Monitoring how these strategies influence AI reliability, security, and operational costs will be essential over the coming months.

Amazon

AI testing and verification software

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system behavior?

According to the whitepaper, the model’s output is heavily influenced by how it is configured—through prompts, tools, and rules—making the surrounding system the dominant factor.

How can organizations improve their AI systems based on this insight?

Focusing on designing better harnesses, including prompts, context management, and verification processes, can lead to more reliable and cost-effective AI outcomes.

Does this mean AI models are less important?

No, but the whitepaper emphasizes that the value of models is limited without proper system configuration. The real gains come from how models are integrated and controlled.

What are the risks of focusing less on models?

While shifting focus can reduce costs and improve security, it requires investment in system engineering and may slow down rapid model updates or innovations.

Will this change how AI tools are developed and sold?

Potentially, as companies may prioritize developing better harnesses, tooling, and verification frameworks rather than solely pushing new models.

Source: ThorstenMeyerAI.com

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Cornford and Cross Team

The model is only 10%