Vibe Coding to Agentic Engineering
Core Concept
“Vibe coding” — free-form prompt → code, with the human reviewing each output — is the toy phase of AI-assisted development.
“Agentic engineering” — structured workflows for AI agents across the full SDLC — is the mature form.
The shift moves the programmer’s job from writing and reviewing code to designing the workflow and defining “good enough.”
“A lot of time was spent discussing how to move from vibe coding to agentic engineering, by structuring workflows for AI agents across the SDLC. For example, building agent prompts for ‘ility requirements’ (reliability, flexibility, stability, scalability), running them in sequence for code review. The questions we’re all facing are ‘What should the workflow be?’ and ‘how do we know when it’s good enough?’” — Robbie McKinstry, DevGuide conference takeaway (May 2026). See source.
The Shift
| Vibe Coding | Agentic Engineering | |
|---|---|---|
| Unit of work | Prompt → code | Workflow → artifact |
| Human role | Author + reviewer | Workflow designer + evaluator |
| Quality mechanism | ”Looks right to me” | Defined acceptance criteria run as agent prompts |
| SDLC posture | Single-stage | Multi-stage, composable |
| Failure mode | Output looks plausible but breaks in prod | Workflow gaps — unevaluated dimensions slip through |
| Skill being developed | Prompt crafting | Specification, evaluation, workflow design |
The concrete example from the source: instead of one prompt that writes code, you have a pipeline of agent prompts. One prompt checks reliability. Another checks flexibility. Another stability, another scalability. Each is a distinct evaluation stage. The human composes the pipeline and defines the pass criteria.
The Two Open Questions
Per the source, the entire industry is wrestling with:
-
“What should the workflow be?” — There is no canonical SDLC-for-agents yet. Every shop is building its own. The room of Tailscale/Sentry/Honeycomb founders didn’t have a shared answer.
-
“How do we know when it’s good enough?” — Code that “compiles and passes tests” used to be the floor. Now it’s the ceiling of vibe coding. The new floor needs explicit ility coverage, security review, performance budgets, operational fitness — each as an evaluation stage with a defined pass threshold.
Whoever answers these questions cleanly first will define the practice. This is why the SDLC-for-agents space is currently a land grab.
The Role Survival Paradox
The same DevGuide consensus held that code review by humans goes away, but programming as a profession does not. This is not a contradiction — it’s a role shift:
- The programmer no longer reviews code line-by-line. Agents do that, in parallel, across more dimensions than any human could hold.
- The programmer does design the workflow, define the acceptance criteria, and evaluate the agent’s output at the artifact level.
The job moves upstream from “author of code” to “architect of the production system that authors code.” See Bottleneck-Shifts-Upstream.
How to Apply
For any AI-assisted development effort, ask:
- What stages does this work need? (Discovery, design, test gen, implementation, review, QA, deploy.)
- What’s the agent prompt for each stage? (Including the implicit “ility” prompts: reliability, security, performance, accessibility, etc.)
- What’s the acceptance criterion at each stage? (Concrete pass/fail, not “looks good.”)
- Where are the human checkpoints, and what’s the human evaluating? (The artifact, not the code.)
- What’s reused vs. bespoke per project? (Workflows compound when they’re reusable.)
If you can’t answer 1–4, you’re still vibe coding regardless of which model you’re using.
Where I’ve Seen It
- Pipeline skills (Stage 1–7): Discovery → Architecture → Gameplan → Test Generation → Implementation → Review → QA Plan. This is agentic engineering in practice at small scale. Each stage has a defined prompt, output artifact, and acceptance criterion. Runs via pipeline-autopilot for autonomous use. The Pipeline skills are the answer to “what should the workflow be?” for one engineer’s productivity.
- Trabian’s Mesh (May 9 demo): Matt Dean’s framing — AI at the construction layer, deterministic execution at runtime — is the same idea applied to ops/operational workflows. See Build-AI-Run-Deterministic.
- A skeptic-to-believer arc: an engineer moved from “AI is bullshit” to a substantive agentic-engineering use case in 60 minutes after picking a concrete observability workflow. The shift from vibe (free-form AI experimentation) to structured (specific scoped workflows) was the unlock.
- DORA Elite teams: a downstream signal that structured workflows produce predictable throughput. Elite teams aren’t faster because they vibe-code better; they’re faster because their workflow is structured.
- WCP Cloud existence: WCP exists because agentic engineering requires context continuity across sessions. Without WCP, every agent session restarts cold and the workflow can’t compound.
Related Patterns
- Build-AI-Run-Deterministic — closest sibling. Agentic engineering operates AI at the build/spec layer, with deterministic runtime. Same architectural insight expressed at a different scale.
- AI-Ready-Engineering — the prerequisites. Code health, TDD, and supervisory workflows are the substrate. Fowler/Tornhill argue: AI amplifies whatever you already have. Without AI-ready engineering, agentic engineering degrades into vibe coding wearing a workflow costume.
- Augmentation-Over-Automation — agentic engineering is augmentation applied to the SDLC. The agents do the 80% (per-stage execution); humans do the 20% (workflow design, evaluation).
- Capability-Autonomy-Risk-Triangle — agentic engineering is what you build inside the triangle’s safe corner: high capability per stage, low autonomy across stages (human decides advancement), low risk because each stage is checked.
- Toy-Dismissal-Trap — “vibe coding” was the toy phase. Don’t dismiss; raise ambition. The mature form is agentic engineering. Same trap shape as Excel/VBA → no-code → agentic platforms.
- Bottleneck-Shifts-Upstream — the role-survival paradox. Programming moves upstream from code authorship to workflow design and acceptance evaluation.
- AI-Planning-Inflection-Point — directly addresses the “how do we know when it’s good enough?” question. The inflection point is when more planning stops producing signal; at that point, ship a slice and iterate on real feedback.
- Solve-The-Meta-Problem — agentic engineering is the meta-problem solution for code production. Don’t solve the project; solve the system that solves projects.
- Iteration-Speed-Is-The-Strategy — agentic engineering compounds iteration speed because the workflow is reusable across projects.
- Defibrillator-Demo — the social mechanism for shifting an org from “AI is hype” to “we need to invest in agentic engineering.” Step-change demos do what incremental ones can’t.
Cross-References
- Capability-Autonomy-Risk-Triangle — sister atomic from the same source