Sick of AI Agent Frameworks

15 points by ailover 4 days ago | 7 comments

i've been using loads of ai agent frameworks (crewai, langgraph, autogen) and i just don't get why they are so popular. i can literally do everything what they offer spending a few hours with cursor, and better. if i want easy agents i want a platform at a higher abstraction layer where agents are literally ai generated for me and offer robust scalable methods around it like agent workflows, not frameworks that i can generate the code for with my ai. i've been looking at two agent platforms phidata (although still requires quite some coding and workarounds) and omega . ai (more do it for you kind of platform). both pretty impressive and getting some serious agents up, now have a marketing agent doing my daily blogs, social media posts and twitter replies. what are some other non-framework ai agent platforms you use? looking to try out more

bsenftner 2 days ago | next |

These "frameworks" are useless, and as you say you can do what they offer in a few hours and better. So, stop using them. It's not a popularity contest.

Case in point: I don't use any frameworks whatsoever. I wrote a conversational AI Agent that helps me write AI Agents, integrated that into an office suite, unleashed that at the law office where I'm CTO, and we currently have just shy of 900 agents created by me, the attorneys and their staff. They support new client interviews, legal research, document authoring, case financial modeling, and pretty much whatever the staff needs to help them do what they already did before AI, just now they have idiot savant help.

Everything is based upon chat completion with most using structured output that is I/O with the office suite's internal data structures. The AI Agents act as virtual co-workers inside the office software used by staff, and they each personalize their agents to their needs.

If I tried to do this with these frameworks, I'd still be dinking around with their abstractions and lack of documentation.

2024user 21 hours ago | root | parent |

Can you provide some more info? How did you integrate the agents into an office suite? Is that a bespoke office suite?

bsenftner 19 hours ago | root | parent |

I used open source implementations of a web browser based word processor and a spreadsheet, which by the fact that they are open source their source code itself as well as the developer's GitHub and various support forums for those open source tools are in the major LLM's training data.

Typically, I begin by creating a chatbot that I've seeded (told) it knows some open source tool because it was a contributor to that open source project, and I engage in a conversation with it to figure out what data structures and what APIs within that tool are useful for data retrieval and data injection of the application's active in use data. For a word processor, that would be the document itself, and for a spreadsheet that is the cells and their data and/or formulas.

Then, I examine the different ways to get data from and put data into the tool, and write a single purpose AI Agent for each different form of I/O. For example, when working with the word processor there are different things a person might want an agent to do, such as revise the layout of the document. That requires the HTML/CSS format of the document and there is an agent that handles those requests. It is easy to have a document whose HTML/CSS representation is larger than an LLM model's output, and that triggers the creation of two more agents: one that operates on the current selection, meaning it works on a subset of the larger document only, and another that breaks the document into chunks and processes it in parts small enough for the LLM's available output, which requires additional handling to insure the chunking of the document does not affect contextual flow of the transformed document text.

Other things a user might want to do with a document concern the content of the document, the words themselves. For example, a user might want a literary critic to assess how well they wrote something and how understandable it will be for an audience of some characteristics. That type of question does not require the HTML/CSS of the document, it only requires the words, and if delivering the HTML/CSS along with the words the HTML/CSS gets in the way and the LLM has to do extra work to filter the HTML/CSS away to even being looking at the writing quality. Plus, if only sending words, not the HTML/CSS, a lot more information can get delivered to the LLM to consider than can be send as HTML/CSS.

Yet another, different type of question that my system supports is using a document as the context seed to create a new chatbot that knows the subject of the document more than the document itself contains, and is able to identify incorrect portions and misleading or confusing portions of the original document. This feature is very useful with spreadsheets, because general knowledge of using a spreadsheet is weak in most people. The "spreadsheet discussion bot" I have will reverse engineer an unknown spreadsheet and explain how to use it to a person, as well as identify questionable formulas and methods that the spreadsheet may be using.

Each of these examples represent a different type of data representation and it's use from the same document. All I'm doing is figuring out each tools internal data representation and then taking it, changing it with an LLM, and then putting it back into the tool, which then uses that changed data unaware anything changed.

bsenftner 18 hours ago | root | parent |

Of course, what I do with the data using an LLM is separate and can be complex. I have written my own prompting framework I call "method actor promoting" that has two layers: first I tell the LLM that it is a method actor, using the formal terms method actors use. This creates an impersonating LLM that goes further, goes deeper in the impersonation than just a plain LLM. Then I tell that "method actor" the role they are playing is a subject matter expert in whatever it is that the user is trying to do. Then communicating with that LLM Agent requires the user to actually treat them as if the are that expert, and when interacting with them using the terms and language that expert would expect when discussing their professional vocation.

My prompts are larger than most, but my replies from the LLMs tends to be very high quality.

I've also written what I call "chatbotBot" a chatbot that will conversationally gleam from the user a new agent they need, and then chatbotBot writes that new agent, or modifies an existing agent to suit, and then integrates that into the system for immediate use. Then there's "agent morphing" where an agent can have their knowledge and skills morphed to another set of knowledge and skills - which is very useful for some of the more complex agents (the spreadsheet agents) that have complex prompts and are difficult to modify.

You can check this out for yourself at https://midombot.com/b1/home I'm building in public, more or less, with little to no fanfare. What you see was all hand coded by my, except for the word processor and spreadsheet tools, which as I've describe above, I've heavily modified. I do not use AI coding tools, I find they do not help. But I do have multiple coding 'bots I've written that I converse with all the time about the strategy of coding.

lunarcave 3 days ago | prev | next |

They work, just barely, and definitely not up to the marketing hype. I'm in the space, and it's really hard to "sell" a solution to someone who had bought the marketing bs and says "X said they can do Y".

Most "agent frameworks" are just workflow builders with LLMs as a node in the workflow. Zapier can do most of what "agent frameworks" do with having OpenAI / Anthropic as a node. IMHO, to be an "agent", you need agency. And a lot of that agency has to do with having control over the control flow. (Design its own graph at runtime)

Agents with true agency - that is, generating the control flow at runtime is a hard problem (we've spent months into it - and it's still not generalizable). We've got a long way there with good old engineering (deterministic guardrails, progressively enriching context etc). (This is on top of the usual distributed systems problems - for example - solving for invoking tools idempotently / at least once delivery etc)

But the path of least resistance here is claim "How an AI agent 3x'd my inbound and mowed my lawn" and make a youtube video about how agents are going to take over the world.

xzzzzzz 2 days ago | prev | next |

I also find agents not worth the effort. But autogen has a method of creating agents automatically.

thiago_fm 4 days ago | prev |

Agents are all useless at the moment for me. I couldn't find a single good use to assist me in coding.

Maybe they eventually find a way, but right now it's worthless imho.

The updates to the foundational models is what makes a big difference.