How does GAIA make Claude output trustworthy?

GAIA encodes your coding standards as enforceable rules loaded automatically every session. Six coding principles, guardrails against technical debt patterns, scoped skills that apply project conventions at write time, test-driven development with Playwright E2E tests written before code, and a code-review-audit agent that scans every PR for security, performance, and architecture issues before merge.

How does GAIA manage Claude's context window?

GAIA uses three intercepts to keep Claude's working surface lean: path-scoped rules that load only when you edit matching files, LSP-backed symbol search that returns one canonical definition instead of dozens of grep matches, and periodic knowledge audits that prune stale entries and over-budget rules. Token costs stay predictable as the project grows.

How do I report a bug in GAIA?

Run /gaia forensics. It turns any failure into a structured bug report: redacted, classified, and ready to file. User-configuration issues return the fix inline. Probable GAIA bugs file directly to GitHub with one prompt, with the same body byte-for-byte, so the maintainer can reproduce and fix them fast.

How does GAIA keep a Claude Code project healthy over time?

Run /gaia fitness. It audits your entire Claude integration, including hooks, rules, skills, CLAUDE.md, settings, and GAIA install fitness, then grades each area A+ to F and heals what it can on a branch. You review the diff and commit when satisfied. The check protocol survives /update-gaia.

How does GAIA automate project maintenance?

GAIA CI handles dependency updates (patch and minor auto-merge with codemods, major routes to a review PR), daily security audits with pnpm audit, wiki sync whenever app code changes, and monthly stale branch cleanup. Claude handles the safe cases automatically. Humans only see what it cannot safely recover from, with a hard $5 cost cap per run.

What agentic design patterns does GAIA implement?

GAIA implements 17 of the 29 patterns in the agentic design taxonomy: durable planning artifacts requiring user approval, five-tier memory and knowledge base, resource-aware model routing (Sonnet and Haiku for mechanical work, Opus for deep reasoning), session isolation with git worktrees, and six human-in-the-loop checkpoints between Claude's intent and its impact on the codebase.

The Discipline of GAIA

GAIA's discipline is structural. Claude works inside it.

Hooks intercept the tool call. Rules shape the writing. Gates hold the commit. Audits guard the merge. CI keeps the project current.

Discipline is leverage. Each layer turns Claude's capability into work that ships.

Trustworthy by default

You can't trust output you can't predict. Without enforceable conventions, Claude reverts to its training distribution, an average of every codebase on the internet, bad code and all.

GAIA's coding style is what you actually want Claude matching. With GAIA, Claude writes code that follows best practices on day one, and can't ship code that doesn't.

Coding principles
Six coding principles encoded as rules and loaded automatically. Every session starts from the same disciplined baseline. See the principles below.
Guardrails against technical debt
Rules block debt-accumulating patterns from being written in the first place: untyped objects, untested components, hardcoded strings, a11y gaps.
Bundled skills wired in for write-time quality
Scoped skills load on demand when Claude edits matching files. They apply project conventions without re-deriving them each session.
Test-driven development
Specs become Playwright E2E tests before code is written. The first task in any feature is turning red tests green.
Code review before every merge
The code-review-audit agent scans for security, performance, architecture, code smells, and antipatterns. Extendable with custom rules.

Load on demand

Three intercepts keep Claude's working surface lean. Rules sit dormant until you touch matching files. Symbol queries return one answer, not every match. Periodic audits prune what accumulates.

Session start

Scoped rules

Each rule declares the file patterns it cares about via paths: frontmatter. The Tailwind rule loads when you edit a .tsx or .css file. The Storybook rule loads only on .stories.tsx files. The rest stay dormant until matching files are touched.

.claude/rules/tailwind.md

---

paths:

- 'app/**/*.tsx'

- 'app/**/*.css'

---

Every lookup

Symbol search

Serena MCP gives Claude LSP-backed code intelligence. A query for createSession returns one canonical definition. A grep returns every line that mentions the name, and Claude reads each match to triage. Symbol search returns the answer, not the noise.

serena vs grep

grep

serena

matches

read tax

46 ln

type-resolved

yes

imports

guess

trace

Periodic sweep

Knowledge audit

/gaia audit sweeps memory, the wiki, and autoloaded files for cross-store duplication, stale entries, and rules that drift over budget. Two stages: a report with sha256 drift checks, then a mechanical apply. Anything that changed since the report gets skipped.

KNOWLEDGE-2026-05-12.md

Stores scanned84 files

Duplicates6

Over-budget1

Stale3

Actions queued11

State lives on disk

Long Claude sessions bloat. Context piles up each turn, memory accumulates between turns, every new session starts cold and burns tokens re-briefing. GAIA's four workflow commands write durable artifacts to disk, then read them on demand. State persists. The conversation stays small.

/gaia specimmutable

SPEC-007·feat: card grid layout

Intent

Browse cards in a grid. Selection opens a detail pane.

UATs

✓3 cols at viewport ≥ 1024px
✓Enter on focus opens detail
✓Esc closes the detail pane

clarifications.answered: 5 of 5

One question at a time. Discovery state lives in a draft cache, then becomes an immutable artifact. The next step reads the file, not the chat.

/gaia plansubagent

.gaia/local/plans/spec-007-card-grid/

README.mdtask graph

ORCHESTRATOR.mdphase order

KICKOFF.mdfresh-session entry

task-types.mdphase 1

task-grid.mdphase 2

task-detail.mdphase 2

SUMMARY.mdfindings ledger

return payload: file paths only

Planning runs in a spawned subagent. Your main session sees file paths, not the plan. Execution starts cold from KICKOFF.md.

/gaia handoffsynthesis

HANDOFF-2026-05-12-card-grid.md

Branch: feat/card-grid

Context: ported to CSS subgrid

Accomplishments

· Grid shipped (a1b2c3d)
· Detail pane wired

Gaps

· Mobile breakpoint @ 640px open

ref: @app/Cards.tsx:42

Synthesized from the conversation, not dumped. References stay as pointers to file lines, never inlined.

/gaia pickup≤ 15 lines

Branch: feat/card-grid (clean)

Last handoff: 2026-05-12 (12m ago)

Context: ported to CSS subgrid

State

· Grid shipped (a1b2c3d)
· Detail pane wired

Open

· Mobile breakpoint @ 640px

suggested next: pin breakpoint behavior

Reads the handoff, prints 15 lines, archives it after commit. No transcript replay.

Agentic design

GAIA wires design patterns into hooks, agents, rules, commands, and wiki conventions, so they run the same way every session, every agent, every model variant. Not emergent behavior. Not prompts you repeat. The discipline lives in the project.

Workflow control

Planning

Plans are durable artifacts requiring user approval, produced before any work begins: task graph, execution playbook, kickoff prompt.

Memory & Knowledge Base

Five tiers so Claude stops relearning the codebase: wiki, hot cache, handoff snapshots, per-agent memory, cross-project memory.

Resource-Aware Optimization

Mechanical work runs on Sonnet or Haiku. Heavier reasoning routes to Opus when the cost is merited.

Session Isolation

Sub-agents run in fresh contexts so shared state can't corrupt their work. Git-worktree branches add filesystem-level isolation when needed.

Context engineering

Symbol-Aware Retrieval

Serena MCP gives Claude LSP-backed symbol search. A query returns the one definition, not every line that mentions the name.

Skill Activation

Skills load by trigger, not by default. They sit on disk without burning a token until the matching phrase appears.

Knowledge Retrieval

Claude pulls facts from the wiki on demand. A hot cache primes at session start; deeper questions trigger explicit retrieval.

Routing

Path-scoped rules auto-load only when Claude is editing matching files. Conditional hooks route commands to the right gate.

Tooling and safety

Human-in-the-Loop

Six checkpoints between Claude's intent and impact. Quality gate, code-review audit, plan approval, phase gates, destructive-command block, merge confirmation.

Guardrails & Safety

Defense in depth. Secrets blocked from read or write. Debt patterns rejected at the source. The audit covers XSS, SSRF, IDOR, and dep vulns.

Parallelization

Independent work runs concurrently. Audit subagents fan out from a single dispatch. Plan phases run in parallel where dependencies allow.

Tool Use

A curated React-specific tool surface. Linting, types, unit and E2E tests, visual regression, mocking, the GitHub CLI, the wiki.

Discipline, in six principles

Think Before Coding: Surface assumptions. Ask when unclear.
Simplicity First: Write the minimum that works.
Surgical Changes: Touch only what is needed.
Goal-Driven Execution: Define done. Loop until verified.
Always Use TDD: Tests before code, always.
Always Verify Work: Pass the gates before reporting done.

Four from Karpathy · Two added by GAIA

A second brain for Claude

With an Obsidian wiki, Claude understands what you are actually building, not just the code in front of it. Product features, user flows, design rationale, business decisions, architecture, and dependencies all live as focused markdown pages committed to git, not CLAUDE.md, not chat history, not machine-local memory.

GAIA ships with the claude-obsidian integration wired up. The vault structure, the ingestion commands, and the maintenance skills are configured before you write your first page.

auth / login-flow.mdobsidian

tagsauth, frontend, security

status✓ shipped

related[[oauth-providers]] [[session-store]]

updated2026-05-12

Authentication flow

The login route lives in [[routes/auth]] and uses [[session-store]] to persist tokens. See [[security-headers]] for CSP requirements.

Token rotation

Tokens rotate every 24h via [[cron/rotate-tokens]].

Linked from

[[product-flows]] · [[onboarding]] · [[security-audit-2026-04]]

Self-maintaining knowledge base
GAIA keeps the wiki current without you managing it. Commits are evaluated for wiki-worthy content and synced automatically. Consolidation passes catch redundancy and reversed decisions before they compound. Lint sweeps flag orphan pages, dead links, and drift. None of this is a feature you invoke. It is a discipline you get.
Token costs don’t balloon
Token costs do not balloon as the wiki gets richer. Claude only reads the specific information a task needs, so a project with 1,000 pages of context is no more expensive to work in than one with 100.
Why Obsidian?
A local markdown vault means the project’s knowledge persists, compounds, and stays yours, not held in a vendor’s database or trapped in chat history.
Domain isolation is mandatory
Technical, branding, and business knowledge are kept siloed, and Claude does not cross-load between them unless a task genuinely spans more than one.

Maintenance, automated

GAIA's continuous integration layer keeps the project healthy between sessions. The wiki stays in sync with the code. Dependencies stay current and tested. Security findings get patched. Stale branches don't pile up.

Claude handles the safe cases. Humans only see the ones it can't recover from.

WhatWhenAction

Patch / minor dependency bumps

Necessary code migrations and codemods are implemented in the same PR.

weeklyauto-merge

Major dependency bumps

Routes to a separate review-required PR so the upgrade gets human eyes.

weeklyreview

High / critical security findings

pnpm audit. High and critical findings open an issue plus a single-package patch PR.

dailyauto-merge

Wiki sync on app changes

When app code changes, the wiki sync runs and opens a labeled PR.

on commitauto-merge

Wiki rewrite > 25%

A run that rewrites more than 25% of the wiki holds for human review.

on commitreview

Stale branches > 30d

Branches merged more than 30 days ago. Setup also offers repo-level delete-on-merge.

monthlyauto-delete

Auto-merge on green, auto-revert on failure: Every workflow opens a labeled PR and auto-merges on green CI. If post-merge CI fails, the bot opens one revert PR. The hard cap is one revert attempt. A second failure escalates to a priority issue and the bot stops.
Cost-capped by design: GAIA CI runs against your Claude Code Pro/Max subscription or your Anthropic API key. A $5 hard ceiling per run caps API spend.

Upgrades that land finished

All Dependabot and Renovate do is open a PR with a version bump, leaving the actual upgrade work for a human to figure out.

GAIA makes sure Claude does that work. Codemods, source migrations, conflict resolution, all resolved before the PR opens. What you review is a finished change, not half-done work.

Dependabot · Renovate

gaia-react/gaia#1247deps

chore(deps): bump react-router 6.30 → 7.0

-"react-router": "^6.30.0"

+"react-router": "^7.0.0"

0 of 4 checks · awaiting human

· read changelog
· update call sites
· run codemods
· fix breakages

GAIA Update Deps

gaia-react/gaia#1247deps

chore(deps): bump react-router 6.30 → 7.0

-import {useNavigate} from '@remix-run/react'

+import {useNavigate} from 'react-router'

~codemod applied · v7-route-types · 14 files

typechecktests 47/47buildaudit

semver: minorauto-merge on green

Codemods and migrations applied: When a new dependency version requires a codemod or breaking-change migration, GAIA makes sure Claude handles it. The PR includes the bump and the migration together.
Call sites updated when APIs change: When the new version moves or renames public API surface, Update Deps updates the call sites. Grep-replace-with-context work, automated.
Conflicts resolved before the PR opens: When two simultaneous upgrades touch the same code path, Update Deps resolves the overlap before opening the PR. No conflicting PRs that fight at merge.
Iterates until the upgrade lands clean: Runs the upgrade, runs the suite, sees what breaks, fixes it, and runs the suite again. The PR doesn't open until the upgrade is working.

Drift happens

As a project grows, its Claude integration drifts. New hooks, new rules, edited skills, a CLAUDE.md that creeps past its budget, settings that collect redundant permissions. Every bit of that drift makes Claude a little less effective and a little more expensive to run.

/gaia fitness audits the whole surface, grades it F to A+, and heals what it can, keeping your Claude integration lean and current as the project grows.

$ gaia fitnessOverallB+

Hook integrityA+

Skill / command / agent frontmatter1 warningB+

Rule hygieneA+

CLAUDE.md hygiene1 infoA

Settings hygieneA+

GAIA-install fitness1 infoA

Wiki fitnessA+

warning.claude/commands/deploy.md — description frontmatter is missing

infoCLAUDE.md — over the size budget; consider splitting

infoGAIA v1.2.0 installed, v1.2.3 available · /update-gaia

healed 1 findingbranch chore/gaia-fitness-2026-05-12-1430 · review and commit

Heals on a branch, waits for your commit: Fixes land on a branch (a fresh chore/gaia-fitness branch if you’re on main, your current branch otherwise). You review the diff and the report, then commit when you’re satisfied.
Won’t touch an unsafe tree: Detached HEAD, or a rebase, merge, or cherry-pick in progress: it runs triage, prints the grades, and stops. No mutation on a state it can’t safely touch.
Yours to extend: Add project-specific graded check categories. /gaia fitness runs whatever you define alongside the built-in seven.
Survives /update-gaia: The check protocol three-way-merges on upgrade, so the checks you add to it carry forward.

Bugs happen

GAIA is open source, written by one developer (so far). Contributors and sponsors welcome.

/gaia forensics turns “this didn’t work” into a complete report: redacted, classified, ready to file. User-config issues return the fix inline. Probable GAIA bugs file straight to my GitHub with one prompt. Same body byte-for-byte, so I can fix them fast.

github.com/gaia-react/gaia#1247

bugforensics

Plan orchestrator exit 1 during phase 2

gaia-user-3091 opened · 12m ago

Stack

Error: orchestrator exit 1

at gaia/skills/plan/orchestrator.ts:284

Environment

home<redacted>

shellzsh

key[REDACTED_API_KEY]

Context

Running /gaia plan on a feature rework. Exit during the plan-phase-2 quality gate.

— filed via /gaia forensics —

— body byte-identical to local report —

A professional frontend stack

Industry-standard tools, each a default in serious React projects. Refined over years. The same tools humans rely on to ship reliable code. Claude works with them, just faster.

Package management runs on pnpm. Faster installs. Smaller node_modules. Stricter dependency resolution that blocks phantom imports and the easiest path for npm supply-chain attacks.

Foundation