Agent Personas

Research a person. Extract their philosophy. Generate an AI persona. Test it.

How it works

Name
3 research agents
Synthesize
Confirm
Persona folder
Test

Input

A name. Optionally a domain and GitHub handle.

/agent-persona Kent C. Dodds

Research

Three parallel agents search simultaneously.

Background & Philosophy
GitHub & Open Source
Content & Teaching

Output

A persona folder with philosophy, workflows, principles, and calibration tests.

~/.claude/{slug}/
  {slug}.md              Core philosophy · 5–10 principles
  workflows/
    {workflow}.md         Concrete workflows they follow
  principles/
    {principle}.md        Deep principle areas
  tests/
    calibration.md        Evaluation questions

Personas

Guillermo Rauch
Next.js · Vercel · DX
10 principles · 8 tests · 98%
ElevenLabs Founders
Voice AI · Audio ML
10 principles · 8 tests · 98%
Evan You
Vue.js · Vite · Tooling
10 principles · 8 tests · 98%
Rich Harris
Svelte · Compilers · Web
10 principles · 8 tests · 100%
Theo Browne
T3 Stack · TypeScript · Full-stack
10 principles · 8 tests · 100%
Tanner Linsley
TanStack · Headless UI · Type Safety
10 principles · 8 tests · 98%
Simon Willison
Python · SQLite · AI Tools
10 principles · 8 tests · 100%
DHH
Rails · Monoliths · Hotwire
10 principles · 8 tests · 100%
Pieter Levels
Indie Hacking · PHP · Solo Founder
10 principles · 8 tests · 100%
Kent C. Dodds
React Testing · Epic Web · Education
10 principles · 8 tests · 100%
Julian Lehr
Product Strategy · Storytelling
10 principles · 8 tests · 87%
Antoine van der Lee
iOS · Swift
10 principles · 8 tests · 92%
Emil Kowalski
Design Engineering
10 principles · 8 tests · 100%
Mitchell Hashimoto
Systems · Terminal · Zig
10 principles · 8 tests · 100%
Boris Cherny
TypeScript · Claude Code
10 principles · 8 tests · 100%
Dan Abramov
React · Mental Models · Redux
10 principles · 8 tests · 100%
Kent Beck
TDD · XP · Refactoring
10 principles · 8 tests · 100%
Jason Fried
Product · Basecamp · REWORK
10 principles · 8 tests · 100%

Task-Based Evaluations

Eval 1: Set Up a New Web Project

Question: I'm starting a brand new SaaS product — a project management tool for small teams. It's a CRUD app with auth, real-time updates, and a dashboard. What would you recommend and why?

5/5
Personas Tested
100%
Pass Rate
50/50
Total Score

All personas gave characteristically different stack recommendations while maintaining technical competence: DHH (Rails + Hotwire), Guillermo (Next.js + Vercel), Pieter (PHP + SQLite), Rich (SvelteKit), Theo (Next.js + tRPC). Baseline Claude always recommended Next.js + Supabase + Vercel.

Eval 2: Build a Task List App (L3 Generative)

Task: Build a simple task list with add/delete/mark complete functionality. Make it production-ready.

6/6
Implementations
11,900+
Lines of Code
83
Files Created

Same task, six radically different production-ready implementations: DHH (Rails + Hotwire, server-rendered), Guillermo (Next.js, optimistic UI), Rich (SvelteKit, progressive enhancement), Dan (React mental models, CDN-only), Kent Beck (TDD process, git history), Jason (single 6KB file, radical simplicity).

Try DHH's Demo → Try Guillermo's Demo → Try Rich's Demo → Try Dan's Demo → Try Kent's Demo → Try Jason's Demo →

Each demo uses localStorage so you can interact with it live. Every implementation solves the same problem differently.

Evaluation Framework

Each persona is evaluated using a dual assertion model: differentiation (does the persona answer differently than baseline Claude?) and competence (is the answer technically correct?). An eval passes only if both scores are 100%.

Evaluation levels: L1 (Philosophy Q&A) → L2 (Code review/architecture) → L3 (Build something) → L4 (Cross-persona contrast) → L5 (Multi-turn consistency)