Y Combinator•May 8, 2026

AI Agents Software Development Productivity Prompt Engineering Artificial Intelligence Entrepreneurship Y Combinator

Thin Harness, Fat Skills: The New Way To Build Software

YouTube · 57lDpTwiW6g

Quick Read

Gary Tan, YC CEO, details how he shipped hundreds of thousands of lines of code in months after a 13-year hiatus, leveraging AI agents and a "thin harness, fat skills" approach to achieve 400x productivity.

●AI agents enable unprecedented software development speed, allowing one person to ship hundreds of thousands of lines of code.

●"Token maxing" and comprehensive AI-driven testing are critical for building robust, high-quality AI-generated software.

●The future of building involves simple code harnesses orchestrating powerful, markdown-defined AI skills, with humans providing essential agency.

Summary

Gary Tan, CEO of Y Combinator, shares his unexpected return to software development after 13 years, building popular open-source projects like Gary's List and GStack. He attributes his ability to ship hundreds of thousands of lines of code while running YC full-time to a new paradigm of AI-assisted engineering. This involves "token maxing" (feeding LLMs extensive context), using AI for comprehensive testing, and adopting a "thin harness, fat skills" philosophy where simple, robust code (harness) orchestrates powerful, markdown-driven AI agents (skills). He emphasizes the importance of human agency in guiding AI and views this era as a "Homebrew Computer Club" moment for personal AI, urging builders to embrace the new tools despite their current "brittle" nature.

This episode provides a practical blueprint for developers and founders to dramatically increase their output using AI agents. It challenges traditional notions of coding productivity, highlights the strategic importance of "token maxing" and comprehensive AI-driven testing, and frames the current AI development landscape as a pivotal moment akin to the personal computer revolution, emphasizing the need for human control over AI tools.

Takeaways

❖Gary Tan, after a 13-year coding break, shipped hundreds of thousands of lines of code in months by leveraging AI agents, achieving 400x his previous output.
❖The "Thin Harness, Fat Skills" paradigm advocates for minimal, stable code (harness) to orchestrate powerful, markdown-based AI agents (skills).
❖"Token maxing" – providing LLMs with maximum context and allowing them to "boil the ocean" – is crucial for comprehensive research and complete code generation.
❖AI agents can achieve 80-90% test coverage, automating the least enjoyable part of coding and significantly improving software quality.
❖The current AI development environment is compared to the "Homebrew Computer Club" era, requiring builders to be "mechanics" for their powerful but sometimes brittle tools.
❖Embracing personal AI and learning to write effective prompts is essential for maintaining control over one's tools, rather than being controlled by corporate algorithms.
❖Founders should view token spend as "rent" – a critical investment for competitive advantage, not an expense to economize.

Insights

1400x Productivity Boost with AI Agents

Gary Tan, after a 13-year hiatus from coding, returned to building and achieved a 400x increase in logical lines of code output compared to his previous coding peak. This was accomplished by directing 15 AI agents simultaneously, primarily using Claude Code and OpenClaw, to generate and test code.

It was 13 years of not coding and then suddenly boom, I'm doing about 400x the amount of work... I was directing you know 15 agents at a time to do so.

2The "Thin Harness, Fat Skills" Architectural Philosophy

The core idea is to build a "thin harness" – a minimal, robust code loop that handles user input and orchestrates LLM actions – and "fat skills" – complex, context-rich instructions written in markdown that leverage the LLM's latent space for flexible, intelligent execution. This separates deterministic actions (code) from nuanced, knowledge-based tasks (markdown/LLM).

A harness is the core loop that takes the user input gives it to the LLM runs what the LLM does... What we should spending all our time doing is thinking about what markdown should there be?

3"Token Maxing" for Comprehensive AI-Driven Work

To achieve high-quality, complete results with AI agents, it's essential to "token max" – provide the LLM with the most extensive possible context, allowing it to "boil the ocean" in terms of research or code generation. This means feeding it dozens of sources, cross-referencing information, and accepting higher token costs for superior output.

What if you absolutely boiled the ocean... if there is incremental work that makes something more complete more awesome... you should token max.

4AI Automates Comprehensive Testing and QA

AI agents can achieve 80-90% test coverage (unit, integration, end-to-end) for generated code, automating the typically tedious and time-consuming aspects of software development. This significantly improves code quality and allows human developers to focus on higher-level tasks.

I can get to 100% test coverage... hitting 80 to 90% is usually the best practice... the machine doesn't care, it'll just do it.

5The "Ferrari with a Wrench" Analogy for Current AI Tools

Using advanced AI tools like OpenClaw is likened to driving a Ferrari: exhilarating and powerful, enabling unprecedented feats. However, like a Ferrari, these tools are currently brittle and will break down, requiring the user to be a "mechanic" and fix them manually. This highlights the need for deep technical skill alongside AI adoption.

Using OpenClaw these days is like driving a Ferrari... it's a Ferrari that will break down... you need to get out with your wrench and pop the hood and like fix it.

Bottom Line

The current AI development phase is akin to the "Homebrew Computer Club" era, where powerful but unpolished tools (like the Apple I) required users to be highly technical "kit car Ferrari" builders to get them running.

So What?

This implies that early adopters and those willing to get their hands dirty with current AI tools will gain a significant advantage, shaping the future of software development.

Impact

Develop tools, educational content, or communities that simplify the "mechanic" aspect of AI agent development, making these powerful tools more accessible to a broader audience without sacrificing control.

The defining question for the personal AI revolution is whether individuals will control their own AI tools and data, or if corporate entities will dictate the experience, similar to social media feeds.

So What?

This is a call to action for users to learn prompt engineering and take ownership of their AI agents to prevent a future where personal AI is centrally controlled and optimized for external business models.

Impact

Build open-source, privacy-preserving personal AI frameworks and educational platforms that empower individuals to customize and control their AI agents, fostering a decentralized AI ecosystem.

Opportunities

AI-Powered Investigative Journalism Platform

A platform that uses agentic retrieval (Perplexity, X, Grok APIs) to "boil the ocean" on any topic, ingesting vast amounts of internet data, cross-referencing sources, and generating detailed, fully sourced long-form articles and reports. It acts as an investigative journalist, not just a publishing tool.

Source: Gary's List, which functions as an agentic newsroom for California politics.

AI Agent Workflow Orchestration and Skill Repository

A system (like GStack) that allows users to define and chain AI "skills" (e.g., CEO skill, Designer skill, QA skill) using markdown, enabling complex, multi-agent workflows for software development, product planning, or other knowledge work. It would manage agent interactions, context passing, and provide a human-in-the-loop interface.

Source: GStack, which Gary built to automate his development process.

AI-Driven Automated QA and End-to-End Testing Service

A service that leverages AI agents (e.g., wrapping Playwright) to automatically perform comprehensive QA, including end-to-end, integration, and unit testing, for new features or entire codebases. It would interpret context, simulate user flows, and report bugs, drastically reducing manual testing effort.

Source: Gary's experience building automated QA for Gary's List using Playwright and Claude Code.

Key Concepts

Thin Harness, Fat Skills

This architectural principle suggests building minimal, stable code "harnesses" that handle core loops and deterministic actions, while delegating complex, context-dependent, and evolving tasks to "fat skills" defined in markdown for AI agents. The harness provides structure, and the skills provide flexible, intelligent execution.

10x Check / 10-Star Experience

A product and design exercise, inspired by Brian Chesky, that pushes beyond conventional improvements to imagine a 10x more ambitious solution or a "10-star" user experience. Applied to AI, it encourages thinking about how AI can deliver disproportionately higher value (10x value for 2x effort) by leveraging its latent space capabilities.

Lessons

Embrace "token maxing": Don't economize on token spend when using LLMs for critical tasks; invest in providing maximum context to achieve more complete and higher-quality outputs.
Adopt the "Thin Harness, Fat Skills" architecture: Design your AI-powered applications with minimal, stable code harnesses and powerful, markdown-defined AI skills for flexibility and scalability.
Integrate AI for comprehensive testing: Leverage AI agents to automate unit, integration, and end-to-end testing, aiming for 80-90% coverage to ensure robustness and free up human developers.
Cultivate "mechanic" skills: Be prepared to debug and fix AI agent failures, as current tools are powerful but brittle; this hands-on approach is crucial for early adopters.
Learn prompt engineering: Actively develop your ability to write effective prompts and define AI agent behaviors to maintain control over your personal AI tools and avoid reliance on opaque corporate algorithms.

GStack AI Agent Development Workflow

Define Core Idea (Office Hours Skill): Start with a high-level product idea, clarifying "who is it for, what does it do, and what's the impact?"

Strategic Planning (CEO Skill): Apply the "10x Check" and "10-Star Experience" mental models to envision the platonic ideal and most ambitious version of the feature, aiming for 10x value for 2x effort.

Architectural Design (ASCII Diagrams): Before coding, instruct the AI to generate ASCII diagrams of data flows, state machines, dependency graphs, and user flows to load context and ensure completeness.

Developer Experience & UI Review (Designer/Developer Skills): If applicable, run design and developer experience reviews to ensure usability and ease of integration.

Code Generation & Testing (Plange/Codex Skills): Use AI agents (like Claude Code) to generate code, prioritizing 80-90% test coverage (unit, integration, end-to-end) to prevent "slop."

Expert Review & Debugging (Codex/Claude Switch): For complex problems or persistent bugs, switch between different AI agents (e.g., Codex for "200 IQ nearly nonverbal CTO" problems, Claude for "ADHD CEO" oversight) to find solutions.

Human-in-the-Loop QA (Browse/QA Skill): Manually test UI or data mutations using an AI-powered blackbox browser (like Browse) while providing human feedback, as the human operator's understanding remains irreplaceable.

Notable Moments

Gary Tan's return to coding after 13 years, shipping hundreds of thousands of lines of code while running YC full-time.

This demonstrates a paradigm shift in developer productivity enabled by AI agents, challenging the notion that such output is impossible for a single individual, especially one with a demanding executive role.

The realization that AI can automate comprehensive software testing (80-90% coverage), a task typically disliked by developers.

This highlights AI's potential to improve software quality and free human developers from tedious work, allowing them to focus on more creative or strategic tasks.

The comparison of current AI development to the "Homebrew Computer Club" era and "kit car Ferrari" phase.

This frames the present as a foundational moment for personal AI, emphasizing that early adopters who are willing to grapple with imperfect tools will be at the forefront of a new technological revolution.

Quotes

"Will you have control over your own tools or will your tools have control over you?"

Gary Tan

"Using OpenClaw these days is like driving a Ferrari and it's like exhilarating. It's insane... But then it's also like a Ferrari and that you better be a mechanic. like it's a Ferrari that will break down on the side of the road... and you need to get out with your wrench and pop the hood and like fix it."

Gary Tan

"If you token max like that's actually the coolest thing you can do now and it's not just in you know generating articles it's not you know it's clearly in uh writing code right I think now it's it's going to permeate every part of society like every thing that we would call knowledge work could be token maxed."

Gary Tan

"I just want the machine to do the stuff that I don't want to do."

Gary Tan

"You could have infinite time by borrowing the time from the machines."

Host

Q&A

Related Episodes

85 South• Mar 23, 2026

DAPHNIQUE SPRINGS | ENGLISH MAJORS | SEASON 3 | EP 10

"Comedian Daphnique Springs joins the hosts for a freewheeling discussion on everything from her childhood fish-selling hustle and the 'Florida Man' phenomenon to deep-sea conspiracies, the future of AI, and parenting advice for children with ADHD."

ComedyEntrepreneurshipFlorida Man+2

Google TechTalks• Feb 2, 2026

Code Health Guardian

"This talk introduces a comprehensive model for understanding and managing code complexity, arguing for its objective nature and the critical role of human understanding in the AI era to maintain software health."

Software EngineeringArtificial IntelligenceProductivity

Matthew Cox | Inside True Crime• May 10, 2026

Wes Watson Student Arrested as a Firefighter

"A former firefighter and paramedic recounts his harrowing descent into drug addiction, multiple brushes with the law, and ultimate recovery, culminating in a successful business partnership and a transformed life."

Addiction RecoveryPersonal TransformationEntrepreneurship+2

The Joe Budden Podcast• May 7, 2026

The Joe Budden Podcast Episode 926 | Let’s See That Snapper

"The hosts dissect the collapse of Spirit Airlines, the predatory nature of music contracts, and the weaponization of private sexual preferences in public disputes, all while celebrating the Knicks' playoff dominance."

NBA PlayoffsCelebrity LawsuitsDomestic Violence+2

Thin Harness, Fat Skills: The New Way To Build Software

Quick Read

Summary

Takeaways

Insights

1400x Productivity Boost with AI Agents

2The "Thin Harness, Fat Skills" Architectural Philosophy

3"Token Maxing" for Comprehensive AI-Driven Work

4AI Automates Comprehensive Testing and QA

5The "Ferrari with a Wrench" Analogy for Current AI Tools

Bottom Line

Opportunities

AI-Powered Investigative Journalism Platform

AI Agent Workflow Orchestration and Skill Repository

AI-Driven Automated QA and End-to-End Testing Service

Key Concepts

Thin Harness, Fat Skills

10x Check / 10-Star Experience

Lessons

GStack AI Agent Development Workflow

Notable Moments

Gary Tan's return to coding after 13 years, shipping hundreds of thousands of lines of code while running YC full-time.

The realization that AI can automate comprehensive software testing (80-90% coverage), a task typically disliked by developers.

The comparison of current AI development to the "Homebrew Computer Club" era and "kit car Ferrari" phase.

Quotes

Q&A

Recent Questions

Related Episodes

DAPHNIQUE SPRINGS | ENGLISH MAJORS | SEASON 3 | EP 10

Code Health Guardian

Wes Watson Student Arrested as a Firefighter

The Joe Budden Podcast Episode 926 | Let’s See That Snapper