AI-Augmented Development Factory Workflow

Sabbatical Independent Builder 2025–2026

Situation

AI-assisted development tools can generate enormous amounts of code. The problem is not generation speed — it’s quality control, consistency, and the compounding cost of skipping steps. Without discipline, AI-assisted development produces code that passes a quick read but fails under review, breaks in CI, or doesn’t match what the product actually needs.

During my sabbatical, I was building across 100+ repositories. I needed a workflow that would let me ship fast with AI assistance while maintaining the quality standards I’d expect from a team I managed.

Decision

I codified a repeatable development loop as a set of slash commands in Claude Code:

/plan-story → /implement-story → /verify → /demo → /pr

Each step enforces a specific gate:

Quality constraints enforced throughout:

Risk

The risk was over-process. A solo developer adding five mandatory steps to every feature sounds like bureaucracy. The workflow could slow me down more than it helped, especially for small changes.

I accepted this risk because the alternative — shipping fast without gates — had already failed. Early in the sabbatical, I shipped features that looked correct but had subtle bugs, missing edge cases, or product mismatches that required rework. The rework cost more than the gates would have.

The other risk: the workflow was designed for me, working with AI. It might not transfer to a team context. I accepted this because the principles (plan before code, test before implementation, review before CI, demo before merge) are universal — only the tooling is specific.

Change

The Factory Workflow let me ship 22 PRDs across billeisenhauer-app in roughly two weeks with consistent quality. Across all sabbatical projects, it became my default operating rhythm.

Specific outcomes:

The workflow also produced transferable judgment about AI-assisted development: the AI is excellent at generating code from a clear plan and test spec, but poor at deciding what to build, what to skip, and when the product is actually done. The human gates (plan, review, demo) are where the judgment lives.

What This Demonstrates