Subagent Fan-Out - Neo Agent

Some jobs are naturally repetitive — produce a tailored summary for each company on a list, sweep ten devices for the same condition, or research a question from several angles at once. Subagent Fan-Out lets the agent split the work into pieces and tackle them in parallel instead of one after another. When all the pieces are done, the agent comes back with the results bundled together.

Subagent Fan-Out is available on every agent. There’s nothing to enable — the agent decides on its own when a given job is a good fit for fanning out.

When It Helps

Fan-out shines when the pieces of work are independent — each piece doesn’t need to know what the others are doing.

One output per item — a tailored note per company, a check per device, a summary per ticket cluster.
Multiple angles on the same question — investigate a topic from several sides at once and aggregate the findings.
Bulk audits — sweep many systems for the same condition without waiting for each one in turn.

When It Doesn’t Fit

Workload shape	Why fan-out hurts
Building something that has to fit together	A coherent app, a single long document, or a refactor across files — parallel pieces make conflicting choices and don’t compose.
Sequential work	If step 2 depends on step 1, there’s nothing to parallelise. A regular agent run is the right shape.
A handful of API calls	If the work is “make ten quick calls and combine the results,” a single agent with the right tools is cheaper than spinning up parallel runs.
Writes against shared records	Two parallel runs updating the same ticket or the same document can step on each other. Keep writes single-threaded.

If you want to nudge the agent toward (or away from) fanning out on a particular workflow, say so in the agent’s custom instructions — e.g. “when generating per-company summaries, do them in parallel.”

Safety

Control	Behaviour
Capped parallelism	An agent can’t spin up more than 10 parallel pieces in one fan-out, and an agent run can fan out at most 5 times — so a single run is capped at 50 subagents total. If a job genuinely needs more, the agent splits it into batches; if it would exceed the 5-wave limit the run fails clearly rather than spawning indefinitely.
No runaway runs	Each parallel piece has a runtime ceiling. Anything that overshoots is stopped and reported back as a timeout rather than quietly burning resources.
No nesting	A piece that’s running in fan-out can’t fan out again. This keeps the worst-case cost and complexity bounded.

Billing

Each subagent counts as its own agent execution. The cost of a fan-out run is the parent agent’s normal cost plus one execution per subagent — same rules you already know from Billing & Credits. How that breaks down depends on how the parent agent is set up:

Parent agent type	What each run costs
Triggered (runs on a ticket or time entry)	A flat cost per execution based on the agent’s most expensive tool — usually 1 credit, 2 if the agent uses Standard tools, 3 if it uses Advanced tools. Each subagent costs that same flat amount.
Scheduled (runs on a timer or manually from the dashboard)	Billed per tool the agent actually used — 1 / 2 / 3 credits per Basic / Standard / Advanced tool call. Each subagent is billed the same way, against the tools it called.

Worked example. A Triggered agent at the Standard tier dispatches 10 subagents. Cost = 2 (parent) + 10 × 2 (subagents) = 22 credits. A Scheduled agent makes 1 Basic tool call, then dispatches 3 subagents, each of which makes 2 Basic + 1 Standard call. Cost = 1 + 3 × (2 + 2) = 13 credits.

Failed or timed-out subagents are still billed — they consumed work even if they didn’t return a useful result. The agent’s own self-assessment of execution quality (visible on the execution log) is separate from the credit charge.

Fan-out is a power tool. The agent reaches for it only when the job has the shape of “do the same thing N times” — most workflows still run as a single agent.

​When It Helps

​When It Doesn’t Fit

​Safety

​Billing

When It Helps

When It Doesn’t Fit

Safety

Billing