Grind Mode
Tackle large QA tasks with parallel agents that plan and execute over hours.
Grind mode is designed for large QA tasks that require planning, coordination, and extended execution time. Instead of a single agent processing your prompt, Grind uses a planner that breaks your task into subtasks and delegates them to parallel workers.
When to Use Grind
| Use Standard Mode | Use Grind Mode |
|---|---|
| Single bug fix | Multi-file bug investigation |
| Generate a few tests | Comprehensive test coverage across a codebase |
| Quick code review | Full security or performance audit |
| Simple patch | Cross-cutting refactors and regression fixes |
How Grind Works
Phase 1: Planning
The agent investigates your codebase — reading files, running builds, checking tests — and produces a structured plan with phases and subtasks.
The plan is presented for your review before any code changes are made.
Plan Approval
You control when and how plans are approved:
- Manual approval — Review the plan and approve or reject it
- Auto-accept — Plans execute automatically without waiting for approval
When a plan is awaiting approval, you can:
- Approve — The agent begins execution
- Reject — The agent revises its approach
- Send a message with additional context or corrections
Phase 2: Execution
Once approved, the planner delegates subtasks to parallel workers:
- 2-8 workers run simultaneously on different parts of the codebase
- Each worker is scoped to specific directories to avoid conflicts
- Workers can be assigned different model tiers based on task complexity
- Sub-planners can recursively delegate for deeply complex tasks (up to 3 levels)
Verification
After all workers complete, the agent:
- Runs the build to verify compilation
- Runs tests and compares against the baseline
- Runs linters and checks for regressions
- Commits and pushes changes
Enabling Grind Mode
- Open the Agent page in the dashboard
- Toggle Grind in the input bar
- Select a time limit (e.g., 3 hours, 5 hours, 10 hours, or until done)
- Choose your plan approval preference (manual or auto-accept)
- Send your prompt
Sandbox Environment
Grind sessions run in an isolated sandbox with a full desktop environment. The Sandbox Panel gives you:
- Desktop view — Live stream of the agent's virtual desktop via VNC
- File changes — Real-time view of modified files
- Plan phases — Track progress through the execution plan
- Terminal — See command output
This is especially useful for E2E testing and UI validation, as the agent can interact with web applications visually and verify behavior in a real browser.
Tips
- Be specific — Detailed prompts produce better plans. Include what to test, expected behavior, edge cases, and constraints.
- Start with auto-accept off — Review the first few plans manually to calibrate quality, then switch to auto-accept for trusted workflows.
- Use for coverage goals — Grind is ideal for "add tests for all untested modules" or "fix all flaky tests in this repo" type tasks.