Discussion about this post

User's avatar
Leonidas Raghav's avatar

Great article! I work on enterprise agents, and this piece lays out really well the issues and strategies my team has run into. We’ve found there’s a spectrum between generalisability and reliability: workflows are precise and more robust but rigid and domain-specific (almost like traditional software), while agent-based setups are more flexible but prone to errors and reliability issues.

What’s worked well for us is actually to start agent-first: give the model a set of tools and observe how it behaves. That exploration phase surfaces the paths that really matter, which can then be hardened into workflows for better reliability and cost savings, while still keeping agent flexibility for edge cases. In practice, this can strike a good balance between generalisability and reliability.

Expand full comment
1 more comment...

No posts