The Ultimate Agentic Engineering Workflow: Building Features with Multi-Agent Systems | Tom Karels

The Ultimate Agentic Engineering Workflow: Building Features with Multi-Agent Systems

YouTube

This video presents an advanced look into a modern agentic engineering workflow, showcasing how to build complex features using autonomous AI agents. The creator details their current toolkit, which centers on GPT-5.5 extra high fast as the primary model, Cursor for the integrated development environment, Greptile for automated code reviews, and Wispr Flow for streamlined voice-to-text prompting. The demonstration focuses on building an artifacts feature for the Pluto app, an enterprise-focused AI agent platform. This feature allows the AI to generate and preview interactive web components like HTML, React, and SVG files directly within the chat interface. The core of the process highlights the symbiotic relationship between human developers and AI sub-agents. The workflow involves delegating research and coding tasks to autonomous entities, followed by a rigorous, automated verification process using stacked pull requests. A key innovation discussed is the use of a recursive feedback loop where Greptile identifies bugs or security vulnerabilities, and Cursor automatically addresses these issues through multiple iterations until a perfect score is achieved. This approach significantly reduces manual debugging time while ensuring code quality through consistent architectural patterns and modular service layers.

AI Agents Agentic Engineering Cursor IDE

This video provides a deep dive into high-level agentic engineering, demonstrating how to build complex interactive features by orchestrating multiple AI agents and specialized development tools. Viewers will learn a specific workflow using GPT-5.5, Cursor, Greptile, and Wispr Flow to create an interactive artifacts preview system within a live application. The content focuses on moving beyond simple text-based AI assistance toward a fully autonomous cycle of research, generation, automated review, and recursive debugging.

Key Takeaways

Primary Stack: Effective agentic engineering currently benefits from high-speed models like GPT-5.5 integrated into Cursor, combined with specialized review tools like Greptile.
The Feedback Loop: Recursive cycles where one AI agent generates code and another critiques it (using tools like Greploop) are essential for achieving production-ready code quality.
Small, Stacked PRs: Breaking down large feature updates into multiple small, manageable Pull Requests (under 1,000 lines) prevents AI models from hallucinating or overlooking critical errors.
Sub-Agent Delegation: Complex tasks should be offloaded to specialized sub-agents to keep the main chat thread unblocked and maintain clear focus.

Timestamps

00:00

The Evolved Agentic WorkflowIntroduction to the latest changes in the creator's agentic development process.

00:42

My Tech StackBreakdown of the primary tools: GPT-5.5, Cursor, Greptile, and Wispr Flow.

01:50

Project Overview: Pluto ArtifactsDefining the new feature being built: an interactive artifacts system like Claude's.

04:50

The Greploop MechanismExplanation of the automated recursive code review and feedback loop.

09:05

Service Layer ArchitectureHow to structure code so that AI agents can read and write it more effectively.

12:12

The 5-PR Rollout StrategyWhy breaking features into small, manageable chunks is critical for AI agents.

16:00

Debugging the Artifacts FeatureIterating through UI issues and dark mode themes using automated feedback.

21:10

Autonomous Review CycleDemonstrating agents fixing security bugs and logic defects without human intervention.

Target Audience

Software engineers, AI researchers, and tech entrepreneurs interested in leveraging autonomous AI agents to accelerate software development and improve code quality.

Use Cases

-Implementing automated code review pipelines using AI sub-agents.
-Designing complex interactive features for web applications using generative models.
-Setting up a recursive AI debugging loop to handle edge cases and security flaws.
-Architecting modular service layers to facilitate easier AI-driven code modification.
-Scaling development capacity by delegating research tasks to specialized AI agents.

Key Topics