Looking for feedback: local AI agent that executes tasks (planning → code → execution → verification)

Posted by Outside-System-3698@reddit | LocalLLaMA | View on Reddit | 7 comments

Hi all,

I’m experimenting with an agent design and would like some feedback.

I’ve been working on a local-first AI engineering agent called ZERO.

Instead of just generating text, the system actually executes tasks:

Requirement → Planning → Code → Execution → Verification

Current demos:

  1. Requirement demo:

Takes a requirement.txt and produces structured outputs like:

- project_summary.txt

- implementation_plan.txt

- acceptance_checklist.txt

  1. Mini build demo:

Takes requirement + input data and:

- generates Python code (number_stats.py)

- executes it

- produces verified output (stats_result.txt)

Everything runs locally, with visible artifacts and task state.

This is more of an engineering agent runtime than a chatbot.

Repo (with demos and execution traces):

https://github.com/setsuna701031/ZERO_AI

Would love feedback on:

- Does this approach to agent loops make sense?

- Where would you draw the boundary between planning and execution?

- What would you prioritize next (reliability vs capability)?