Published on September 22, 2025

From Shower Ideas to Production: Autonomous AI Agents

TL;DR >> I've solved the idea-to-code friction by running autonomous AI agents in VMs. Every shower idea gets spec'd and implemented automatically using loop.sh orchestration, specialized subagents, and proper tooling. <<

Topics:

AUTOMATION

TOOLS

WORKFLOW

Every great project starts with a crazy idea in the shower.

The problem? Getting from lightbulb moment to working code takes weeks of stop-start development, lost context, and soul-crushing context switching.

I’ve solved this by running 100% autonomous AI agents in VMs. Every shower idea gets spec’d and implemented without me touching a keyboard.

It sounds like science fiction. It’s surprisingly straightforward.

# The Philosophy: Maximize Autonomous Execution

The core insight is simple: AI agents don’t need supervision—they need structure. Give them clear objectives, proper tooling, and robust error handling, and they’ll outperform traditional development workflows.

This isn’t about replacing developers. It’s about eliminating the friction between ideas and implementation. When you remove the overhead of project setup, environment configuration, and todo management, you can focus on the creative and strategic work that actually matters.

# The Stack: Three Tools, Maximum Impact

WisprFlow + Claude: Every idea gets spec’d in a tmux session using Termius. WisprFlow captures the voice-to-text brain dump, Claude structures it into actionable requirements. This conversation becomes the project foundation.

loop.sh: The orchestration engine. This bash script runs Claude Code in fully autonomous mode (e.g. dangerously-skip-commits flag), handling task selection, implementation, git commits, and error recovery. It’s designed to run for hours without intervention.

Specialized Subagents: Two critical pieces—task-master for todo file management and git-master for version control. These handle the mundane but essential work that keeps projects moving forward.

# How It Actually Works

Here’s the real workflow:

Ideation Phase: Voice dump the entire concept to WisprFlow while walking or in the shower (VoiceInk works too on Mac). No structure needed—just stream of consciousness.
Specification: Claude takes the voice transcript and creates a comprehensive project spec with architecture decisions, technology choices, and a structured todo file.
Autonomous Implementation: Launch loop.sh pointing at the todo file. The agent runs for hours, selecting tasks, implementing features, handling errors, and committing progress.
Rate Limit Management: When the 5-hour Claude limit hits, the system gracefully stops with clear restart instructions. No lost work, no confused state.
Resume and Repeat: Restart the loop when limits reset. The agent picks up exactly where it left off using session continuity.

# Production Patterns That Emerged

Spec Driven Development: The voice-to-spec pipeline becomes the single source of truth. Every project starts with a comprehensive specification that includes architecture decisions, tech stack choices, and acceptance criteria. This upfront investment pays massive dividends when the agent hits implementation—no ambiguity, no mid-flight architecture changes, no scope creep.

Failure Recovery: The system distinguishes between recoverable errors (missing files, syntax issues) and hard failures (rate limits, fundamental blockers). It attempts recovery for the former and gracefully stops for the latter.

Task Management Mastery: The task-master subagent becomes a productivity multiplier. It maintains perfect todo file hygiene, breaks down complex features into atomic tasks, and provides intelligent task selection based on current context and dependencies. No human ever has to think about what to work on next—the system handles prioritization, estimation, and progress tracking autonomously.

Git Discipline: Every completed task gets committed with meaningful messages. This creates a clean history and enables easy rollbacks if the agent takes a wrong turn.

Cost Tracking: Real-time monitoring of API costs and execution time. You know exactly what each feature costs to implement.

# What Works (And What Doesn’t)

Wins:

Complex refactoring that would take days gets done in hours
Consistent code quality through enforced patterns
Zero context switching between projects
Complete project history with detailed commit messages

Limitations:

Requires well-structured initial specs
Works best with established tech stacks
Can get stuck on ambiguous requirements
Still needs human oversight for architectural decisions

# The Bigger Picture

This approach fundamentally changes how I think about project work. Instead of batching development into dedicated coding sessions, every idea gets immediate implementation. The latency between concept and working prototype drops from weeks to hours.

The key insight: AI agents excel at execution, but struggle with ambiguity. Give them clear objectives and robust tooling, and they’ll outperform traditional workflows. Leave them to figure out requirements or handle edge cases, and they’ll burn API credits spinning their wheels.

# Implementation Details

The loop.sh script handles the orchestration:

# Autonomous execution with subagent delegation
$CLAUDE_CMD \
    -p "Use task-master subagent to review $TODO_FILE and select the next task. Implement completely. Use git-master subagent to commit changes." \
    --append-system-prompt "You are an autonomous coding agent operating without human supervision..."

Key behaviors encoded in the system prompt:

Always use specialized subagents for todo and git management
Never ask for confirmation—make decisions and execute
Document blockers but keep moving forward
Update task status immediately when starting/completing work
Make reasonable assumptions when facing ambiguity

# Results and Next Steps

Six months in, I’ve shipped every idea and side project. The removal of development friction unlocks a different kind of productivity—ideas flow directly into working code.

The next evolution involves multi-agent collaboration. Instead of a single agent working through a todo list, imagine specialized agents for frontend, backend, testing, and documentation working in parallel on different aspects of the same project.

But even the current single-agent approach represents a fundamental shift. When implementation becomes as frictionless as having an idea, the bottleneck moves from execution to creativity.

And that’s exactly where it should be.

The loop.sh script and subagent configurations are available as open-source tools. This isn’t about keeping the approach secret—it’s about proving that autonomous development is practical today with existing tools.

Nikola Balić

Building go-to-market engines for AI-driven products with purpose. Worked with innovative startups like Numarics, Codeanywhere, Daytona, and Steel on growth strategies and market positioning. Faculty at University of Split, researching AI adoption patterns and developer tools.

# The Philosophy: Maximize Autonomous Execution

# The Stack: Three Tools, Maximum Impact

# How It Actually Works

# Production Patterns That Emerged

# What Works (And What Doesn’t)

# The Bigger Picture

# Implementation Details

# Results and Next Steps

Engineers Building with AI

Related Articles

Designing CLI Tools for AI Agents

What Makes a Great Coding Agent

From Bash Script to AI-Native Go CLI in One Session

Nikola Balić