EP 010 - Truth About "Smart" Agents You NEED to Know (intelligent automation agents)
Show notes
Feeling overwhelmed by the buzz around AI agents? In this episode, we dive deep to clarify the real meaning of AI agents and how they differ from traditional automation tools.
You'll Learn:
- The difference between rule-based automation and intelligent automation powered by AI
- How AI agents can revolutionize workflows and achieve goals autonomously
- Real-world examples of AI agents in action (including a shopping assistant that makes phone calls!)
- Ethical considerations of using AI agents on social media platforms
Show transcript
00:00:01: Hello and welcome to our next episode of the AI Boardroom.
00:00:06: Today we have a pretty interesting topic which got a lot of buzz this week.
00:00:12: Serana, tell our audience what this is all about.
00:00:16: Yeah, so today we'll be talking about automated agents.
00:00:20: We'll be covering what they actually are, the types.
00:00:25: So there are different types of agents and what they mean for you, how to actually
00:00:30: implement them, and then what does it mean for your business.
00:00:34: And I think what Edgar was kind of alluding to as far as like being the buzz
00:00:37: of the week is because of Google Next.
00:00:40: And that was a hot topic that was discussed, it happened to be...
00:00:44: be able to join kind of the conference and observe all the events, talk to lots of
00:00:50: vendors, go to sessions and kind of learn, but it was a very strong theme, I think,
00:00:57: at the conference for sure.
00:01:01: So agents, I think in general, people are looking to automate their workflows.
00:01:05: They're looking to implement AI, not just in sales marketing and...
00:01:12: I want to say customer service agents, but we are seeing kind of a lot of promise of
00:01:19: using agents across different workflows.
00:01:21: And I think before we even joined the podcast, you know, Edgar and I were
00:01:25: talking about even for coding or automating workflows as it relates to your
00:01:29: day -to -day work.
00:01:30: You don't have to, again, use it for business necessarily, but how do you even
00:01:35: implement these agents in your day -to -day?
00:01:37: I mean, that is possible.
00:01:38: So I think...
00:01:39: Again, automation and efficiencies are something that can impact your day -to
00:01:46: -day work, but then you can also introduce them into your personal life.
00:01:49: So we'll focus on the business side, but just wanted to mention that it's
00:01:53: definitely something that's being talked about more this week, for sure.
00:01:59: Definitely, yeah.
00:02:00: Yeah, I was kind of curious.
00:02:02: I was also writing to you how it was in Vegas.
00:02:07: I think you still haven't told me the evening stories.
00:02:12: We have to get to this after the podcast.
00:02:16: Of course you didn't.
00:02:17: No, no, no, that's the version for your employer.
00:02:21: Yeah.
00:02:21: Save it.
00:02:26: Okay, no, and I was kind of curious because I've read a lot of headlines on
00:02:32: YouTube and agents were like the Google is all in agents and foreign agents and stuff
00:02:38: like that.
00:02:39: And then after watching the actual keynote and also I watched also the developer
00:02:44: keynote, I was kind of, yeah, yeah, maybe.
00:02:51: What is it?
00:02:52: Yeah, it directly came to me like what is an agent and it was kind of also the idea
00:02:59: of today's episode to dive more into what are agents in the AI world and also what
00:03:08: did Google mean and how did we solve stuff earlier and before.
00:03:13: So yeah, we have a bit of a roundup and deep dive today on what this actually
00:03:18: means.
00:03:20: So I think Solana, you brought it up.
00:03:23: So like we have, you said we have like, we have had already and still have today rule
00:03:30: -based agents.
00:03:31: And we now get into a realm where it's kind of slowly moves into not only rule
00:03:38: -based but also intelligence and intelligence agents, which you love to
00:03:43: elaborate on that.
00:03:44: Yeah, and so I think a lot of people when we talk about, or a lot of buzz that I
00:03:50: would say you would hear outside of these conferences is maybe LinkedIn, this is
00:03:55: where I hang out quite a lot, is automating workflows with Zapier.
00:04:01: And so I just wanted to kind of differentiate that with Intelligent Agent
00:04:06: because Zapier is a rule -based engine.
00:04:10: So and the way that you kind of define and...
00:04:14: differentiate between intelligence and rule base if you have to explicitly spell
00:04:18: out like when you experience this then you do this you're basically are spelling out
00:04:24: spelling out tasks and rules for it to follow and it's limited to the
00:04:31: integrations that it has so Zapier is really not that intelligent but a lot of
00:04:37: there again sales automation agents or i'm sorry the automated agent
00:04:41: experts in AI are selling you this as kind of automations that are intelligent.
00:04:47: And I've seen that quite a lot.
00:04:49: So I just want you to kind of emphasize that the Zapier and those types of tools
00:04:54: that are just automating when you have to spell out explicit rules, it's not
00:04:59: intelligence.
00:05:00: So.
00:05:00: also, like you have to, every branch has to be described as its own way to go.
00:05:06: Whereas if you have true intelligence, what does it actually mean?
00:05:10: Like we have, and that's basically also like, if you look at today's chatbots,
00:05:14: they look not that different, but they are a lot more flexible.
00:05:18: And that's because they are not pre -programmed to specific answers, but they
00:05:24: are programmed to a specific cause.
00:05:27: And...
00:05:28: following that course, they do what they have to and what they are able to with the
00:05:33: toolbox they have been given.
00:05:35: So that's basically like, that's another paradigm.
00:05:38: And that's the same for automation.
00:05:40: So we had Zapier, you also have on the Microsoft side, Power Automate.
00:05:45: I think make .com is also something that came up last year.
00:05:50: So these are all...
00:05:52: like automation workflow tools.
00:05:54: Like we had workflow tools basically from my whole career.
00:05:57: I started in IT in 2010 and a bit earlier like in school and stuff like that.
00:06:03: Like through all that time we had workflows and automations.
00:06:07: And what you mentioned the AI consultants that are running out there and AI
00:06:13: automation agencies, I think they're called.
00:06:19: What they...
00:06:20: just kind of emphasize maybe before you go on and the reason why they're calling them
00:06:23: AIs because you can actually integrate Zapier with AI tools, but that doesn't
00:06:28: make them intelligent.
00:06:29: Yeah, I'm sorry.
00:06:31: I stole your thunder, I'm sorry.
00:06:34: yeah, like Varna said, we have now the option to take these workflows and
00:06:40: automate parts of it.
00:06:42: One example would be I made a chatbot for, I think it was a website which needed
00:06:50: marketing, which needed to onboard marketing customers.
00:06:57: And through that onboarding, I had unstructured data in the form of...
00:07:02: They just typed in their CI and their basic company stuff, uploaded their logo,
00:07:11: that stuff.
00:07:12: So we had still a path to go, but instead of doing every path all the time or every
00:07:19: branch, I was able to integrate AI and significantly, significantly,
00:07:24: significantly, that's the word today, that's like significantly reduce.
00:07:31: reduce the amount of workflow, like parts I needed, because I could do it partly in
00:07:39: a loop.
00:07:39: So I replaced all the branches that were pre -programmed with one AI loop.
00:07:44: So there is intelligence in there, but it's not like the AI itself deciding where
00:07:51: to go and what to do, and maybe come up with new ways, which would be actually
00:07:55: intelligent.
00:07:56: And I like that definition.
00:07:58: I think hopefully for the audience, it's easier to like also decipher when you kind
00:08:01: of break down.
00:08:03: I think AI in general is better understood when you break these distill it down to
00:08:08: the essence.
00:08:08: So if you explicitly have to program rules and define workflows, very likely that
00:08:15: you're dealing with rule -based automation.
00:08:17: If there are things that you can rely on these services or tools that.
00:08:23: You don't have to explicitly spell out every possible scenario.
00:08:27: I think as Edgar was mentioning, that is AI enabled automation basically, or an AI
00:08:33: truly artificial intelligence automation tools.
00:08:36: And I think the other beauty behind it is that they learn based on the paths and
00:08:41: based on the workflows that you actually execute, and then they become better over
00:08:44: time.
00:08:44: And I think that's the intelligent piece is that there's some machine learning,
00:08:48: some enhancements and some learning mechanisms that are embedded in that
00:08:51: process.
00:08:52: So,
00:08:52: you don't have to explicitly spell out every possible scenario and that's where
00:08:57: the automation piece comes in.
00:09:00: right.
00:09:00: For example, I built one solution where the AI gets like a complete document and
00:09:08: makes a step -by -step tutorial from the document.
00:09:12: And there are no pre -programmed steps whatsoever.
00:09:14: It just decides what to give out, if anything, and also decides, okay, I
00:09:19: haven't found anything useful for you.
00:09:22: We have to solve it another way.
00:09:24: So, and that's where, and even that...
00:09:26: could debate if it's really intelligent, but yeah, it doesn't need tutorials, it
00:09:32: just needs a document describing the process and it makes a tutorial from it
00:09:36: automatically.
00:09:37: So that's kind of, but we could maybe take it later on the episode where I can
00:09:43: explain how you get this process intelligent because I thought already
00:09:47: about that.
00:09:48: But yeah, let's first of all talk again about Google.
00:09:53: Google was in the...
00:09:55: on the next keynote.
00:09:57: And as far as I understood, they basically call everything an agent because they have
00:10:01: customer support agents, which are basic chatbots with a knowledge base.
00:10:05: Like it's usually not called agent, it's a rack pipeline.
00:10:09: And you have a chatbot with retrieval argumented generation, which is fine
00:10:16: because the whole grounding is also a word which more and more gets into all the AI
00:10:22: stuff.
00:10:24: grounding is pretty pretty useful because it just grounds everything the chatbot
00:10:28: says on actual documents but it's not an agent at least not how I understand it
00:10:36: Yeah, I'm grounding just for the audience, I think, as you were kind of explaining
00:10:40: it.
00:10:40: So it avoids hallucination.
00:10:42: So if you're basically saying that, I think it's pretty big for businesses,
00:10:48: actually.
00:10:48: So if hallucination is a big risk for your organization, we've seen that where you
00:10:53: have airlines and, I want to say, automotive agents being, not agents, but
00:10:59: chat bots being customer facing, and they just make up stuff.
00:11:03: You want to make sure you limit.
00:11:05: their responses to whatever is available in your documents.
00:11:09: And so if you're, if it doesn't exist in the documents, they basically say, hey, I
00:11:14: don't have that knowledge, I can't respond.
00:11:16: So I think that's, that's what I think Edgar means.
00:11:19: But I think it's a huge consideration.
00:11:21: I heard a lot of that, especially if you're in the banking industries, or any
00:11:25: of the highly regulated industries, that's really huge.
00:11:28: So, and there was a lot of emphasis at the conference on that too.
00:11:33: Yeah, definitely.
00:11:34: So, and that's where kind of like the agent wording fell apart a bit on the
00:11:41: whole event because they use basically everything that integrates Gemini was an
00:11:47: agent.
00:11:48: So, no.
00:11:50: But if that's not an agent, what is?
00:11:53: And they had another really interesting example, which I found, which was shopping
00:11:59: assistant.
00:12:00: Mm -hmm.
00:12:00: And that was more of an agent to me than like just normal chatbot because they even
00:12:06: showed it.
00:12:07: They made a call to the call center, which also was an AI agent.
00:12:13: And it updated her chat during the call, which was, I think it was in the developer
00:12:20: keynote.
00:12:21: I found it fascinating because she was like, she uploaded a video and said, I
00:12:26: want the same shirt as the lead singer has.
00:12:30: And then it searched the shirt, which was looking like this.
00:12:33: And then she called to order, and it updated the stuff in the actual chat,
00:12:39: which I found, like, this was not only multimodal, it was also multi -channel.
00:12:45: And that's more of an agent, from my point of view, because there you have stuff
00:12:50: happening in the background, stuff is creating, stuff is searched, and it's more
00:12:54: of a system of...
00:13:00: AI tasks working together basically.
00:13:05: And I think that's also something I have to give Google props for.
00:13:10: They had this shop, which was the overarching theme or example they went
00:13:16: through.
00:13:16: And they had pretty good examples, I find, to how a business would use them.
00:13:21: And something maybe for people to test out because I did talk to someone on that
00:13:26: particular topic too, although I didn't see the developer keynote.
00:13:30: If you wanted to experience what Edgar is talking about as far as like being able to
00:13:36: shop for specific items, go on Pinterest.
00:13:38: I recently explored that.
00:13:40: Pinterest has that option.
00:13:41: So if you, let's say, you probably can't upload the image, but if you're browsing
00:13:45: something, and I thought it was pretty cool.
00:13:48: in your browsing of an image and you know sometimes it's not really that image is
00:13:52: not linked to a shopping page sometimes it's just an image that's associated like
00:13:56: a stock image with an article but you're like that's a really cool shirt and if you
00:14:00: can actually you can actually double tap it and it'll locate an item that looks
00:14:05: like that for you to shop.
00:14:07: it's nothing new in general.
00:14:09: Google had the stuff also implemented.
00:14:14: But now having...
00:14:18: I don't know.
00:14:19: That was not even their example from the demo that they had.
00:14:24: That was an actual customer of Google.
00:14:26: And they had their pictures indexed in a vector database.
00:14:32: And that was what I found interesting because...
00:14:34: I never thought about it, but it makes total sense if we have multimodal models
00:14:39: to also vectorize pictures and then find the nearest neighbor, basically.
00:14:44: So that was kind of cool.
00:14:47: I like that a lot.
00:14:51: But yeah, still not an agent for me.
00:14:58: So for me, AI agents, this is...
00:15:02: basically where the whole industry is heading into like in that direction.
00:15:08: And it's a topic which I follow like for like the first thing I wanted to build was
00:15:14: actually an agent in AI, but I wasn't, I was limited by the technology of my time
00:15:20: basically to quote Iron Man.
00:15:26: Yeah, to yeah, what I...
00:15:29: think an agent has to be something that works at least partly autonomous.
00:15:38: And there is a bit of autonomy in the search itself when you talk about the
00:15:45: shopping example.
00:15:47: But it's still a passive system.
00:15:50: I still have to say, hey, I have this request.
00:15:55: Please make something that solves this request.
00:16:00: Yes.
00:16:01: What I understand from an agent is, I have this goal, please reach this goal and do
00:16:10: everything you need on the way.
00:16:12: So it basically doesn't start with a problem, but it starts with a goal and all
00:16:19: the issues along the way which arise to reach that goal, it creates tasks for
00:16:26: itself and solves them.
00:16:28: And that's actually an agent for me.
00:16:31: Yes, and I totally agree.
00:16:32: So it would be multi -step, you define the goal.
00:16:35: And then, so for example, in the shopping example, it's actually, I want to buy the
00:16:40: shirt.
00:16:41: So the agent would be able to find based on the picture of whatever the prompt is,
00:16:47: being able to figure out which API to use to which store that is likely, and then
00:16:53: put that shirt into your cart and potentially take you to the...
00:16:58: basically checkout page, so all you have to do is just agree to pay for it.
00:17:02: So it resolved the issue of finding or browsing websites, finding the store,
00:17:09: finding all of these things.
00:17:11: So you don't have to check with a human along the way.
00:17:14: It basically takes you from what you want, from a problem to a solution, and in
00:17:21: however many steps it requires, and it just kind of figures out the rest.
00:17:25: And you can even bring it one step further if you have some cues or trigger which
00:17:34: actually, like the agent is able to like every day look at your emails and do the
00:17:40: work.
00:17:42: You know, like the goal is to keep your inbox clean and then the agent goes ahead
00:17:47: and does everything he needs to.
00:17:49: And I'm personally a big fan of...
00:17:54: partly autonomous agents which have some boundaries where they need human
00:17:59: intervention to keep going or to fulfill a task.
00:18:02: For example, if you have a social media agent, which every day scours your
00:18:09: internal company documents and looks for interesting stuff that happened and
00:18:13: creates a post, for example.
00:18:16: Let's just imagine it takes into consideration contents of meetings.
00:18:21: You might...
00:18:22: want to look over the post before it's posted.
00:18:26: So the last task, which is basically post on LinkedIn or something, there would be a
00:18:33: task where the boundary says only with human intervention, and it creates a task
00:18:37: where the human has to confirm for it to keep going.
00:18:41: Which doesn't mean it cannot work on other posts, but this one specific post is in
00:18:48: the line for being confirmed to then go on.
00:18:51: I think it's...
00:18:52: You have the same thing in the European systems where people change customer data,
00:18:57: for example.
00:18:58: And if it's finance data, then finance has to approve the change.
00:19:03: So stuff like that, it's common also for natural intelligence.
00:19:06: So why shouldn't it be for AI?
00:19:10: So go back to the social, I think, example, because I see a lot that maybe
00:19:13: this is going to be a hot topic for people to, that people are curious about.
00:19:18: So we do see a lot of agents being used on commenting.
00:19:23: And I think, you know, a lot of times people are looking to grow on LinkedIn,
00:19:27: and they try to make themselves visible on people's posts.
00:19:30: And so, and clearly, some of these comments are automated, like there's no,
00:19:37: I'm sure that there's no human,
00:19:39: in the loop involved because there's one post that was pretty funny that happened.
00:19:45: You know, someone, you know, I collaborated with someone and I think the
00:19:49: whole post was, you know, my collaborator basically kind of sharing his experience
00:19:56: and how thankful he was for the collaboration.
00:19:58: And then I think whoever posted the comment was like, oh, I too, I'm so glad
00:20:04: that you, Svetlana, have been, had a chance to collaborate with me.
00:20:08: So.
00:20:09: clearly got the context wrong.
00:20:11: And it was like completely.
00:20:15: So, so he didn't understand the context of like what was basically said or whatever,
00:20:22: but then there's also again, like a lot of discussions that are happening that people
00:20:27: are just kind of again, like there's like clear tell signs that this is automated,
00:20:33: like generation.
00:20:34: Can you maybe say a little bit more as to like how.
00:20:37: some of these agents actually work?
00:20:39: Because again, we see them a lot across social media platforms.
00:20:43: So how does it actually work?
00:20:46: So for me, it's all about task planning and execution.
00:20:51: So like one thing when talking about LLMs, oftentimes arises is that the systems
00:20:57: themselves are not able to plan.
00:21:00: That does not mean generally planning is out of the window.
00:21:05: It means just in a request, it's not planning ahead.
00:21:08: So it just generates the next token.
00:21:11: But what it can do,
00:21:13: And that's how you basically would start implementing it is first is how to, yeah,
00:21:19: like to implement a system, to tell the AI how to fill the task list and where to get
00:21:26: the tasks from.
00:21:28: Because like, like I always say, LLMs are text in, text out, and everything has to
00:21:34: be solved by language.
00:21:36: And you want basically to go from unstructured, your goal, to structured,
00:21:41: your task list.
00:21:42: And that's how it would start.
00:21:46: So you will go ahead and the first thing you have to solve is make one AI persona,
00:21:53: usually there are several AI personas working on the request to plan out all the
00:22:01: steps to get to the goal.
00:22:03: And then it goes ahead and basically with classical programming, you take one step.
00:22:09: after another, or maybe even you could do it with function calling and the AI gets
00:22:13: the next task itself.
00:22:16: But one way or another, you get one task, you execute on the task, you finish the
00:22:23: task, and then you go to the next task.
00:22:24: And that's basically the most basic implementation of that.
00:22:30: So if the context is write me LinkedIn comments, then one task would be, okay,
00:22:36: get the current LinkedIn feed.
00:22:39: Second task would be identify posts which are interesting and worth commenting on.
00:22:47: Third, generate the comments.
00:22:49: Fourth, ask the human if the comments are fine.
00:22:54: And then fifth would be post the task, post the comments.
00:22:59: And that's basically you use the AI as a brain for
00:23:08: planning out and doing all the text work, language work basically.
00:23:14: And then you go ahead and do basic classical programming to execute on that.
00:23:18: And you also can do, can prepare tools and tell the AI, hey, this is the tool you
00:23:25: have, you can execute on it.
00:23:27: But this basically you have to combine the thinking, LLM, with the execution, which
00:23:32: is like normal programming.
00:23:34: That's like the basic concept of how you would do it.
00:23:39: And I think you have to also focus on the tasks.
00:23:41: Again, I'll say this because I think I just see a little buzz also or kind of
00:23:47: talk on this on social media platforms.
00:23:49: I think using chat bots on social platforms is cheating.
00:23:56: You're taking claim for or leaving comments as if it's your presence and
00:24:02: making people feel like you were there to actually.
00:24:05: look at those comments and really think of an output to relationship build.
00:24:11: But then there's really none of you there.
00:24:15: And people, again, follow you and subscribe to you.
00:24:17: And this is why you kind of see these big followings.
00:24:21: But then people don't really communicate with them or don't relationship build with
00:24:27: these individuals because clearly they're their child.
00:24:30: So I feel like...
00:24:32: Again, there's use cases where chatbots should be included, but I don't think
00:24:37: chatbots on social media is one.
00:24:40: Yeah, I think it's cheating.
00:24:41: I'm always a bit...
00:24:46: Yeah, for example, a lot of times in my career, I had people helping me with
00:24:53: social media.
00:24:55: And because I was doing a lot of other stuff and I was just not putting in the
00:25:00: work for, or wasn't able most of the time to put in the work, create the post, craft
00:25:06: it, do a going camera, create the visuals for that.
00:25:11: I never did outsource commenting.
00:25:16: On that I'm kind of with you.
00:25:19: You either want to say something or you don't, but let AI comment on your behalf.
00:25:27: That has also been done by employees of influencers and companies and stuff, so
00:25:33: it's nothing new.
00:25:35: Is there actually a difference for it to be done by AI?
00:25:40: I think so.
00:25:41: I mean, even as you mentioned, I think that there's two schools of thoughts,
00:25:45: maybe one is you leave a comment because you truly are kind of reflecting on
00:25:50: someone's posts, like someone invested the time.
00:25:53: But why would even your employee comment on something that, you know, someone else
00:25:59: has written, right?
00:26:00: So like, what's what's kind of the point?
00:26:01: Like the only purpose, again, between using these.
00:26:05: automated agents basically that are running through social media platforms is
00:26:10: to gain visibility.
00:26:12: And so because you're into grow following, which again, I think it's again, a
00:26:18: cheating way to do that because there's really no authenticity behind it.
00:26:24: And so, and this is why I think I actually value even if you have a huge following
00:26:28: and we know that some of the...
00:26:33: bigger influencers, I think, in the industry.
00:26:35: They don't have the time to comment.
00:26:36: So guess what?
00:26:37: They don't release chat bots or their employees to comment on people's stuff.
00:26:42: Sometimes they'll leave a few comments, but I do think that authenticity is really
00:26:47: huge if you want to really stick to it.
00:26:49: So again, there's use cases.
00:26:50: And the reason why I'm saying this is there's use cases where you should
00:26:54: consider, is this something that I should be using AI for?
00:26:57: And this goes back to ethics.
00:27:01: But there are use cases where, I mean, it's game changing, right?
00:27:05: So things that are truly helping you build efficiencies into your workflows.
00:27:10: But again, when there is those questions like, should I be doing it?
00:27:14: Like ask a friend for an opinion sometimes.
00:27:18: Get a second opinion is all I'm basically hoping to inspire.
00:27:23: And one thing to always keep in mind, you always are in danger of posting stuff you
00:27:33: didn't want to post.
00:27:34: Back in the day, to be honest, I used Lampod, which is basically you are in the
00:27:41: community, and Lampod automates the interaction between the community members.
00:27:48: And I...
00:27:51: Yeah, you get likes and if you want to even get comments and you can even pre
00:27:56: -write the comments for your posts.
00:28:01: And you can comment on other people's posts automatically.
00:28:05: The thing is, I deactivated it after one day, I guess, or so.
00:28:10: Because if you comment on other people's posts and it's a post about right -wing
00:28:16: politics or something, you might not want to have your name under that.
00:28:20: It's like awesome!
00:28:27: Yeah, so that's definitely something to keep in mind or to have an eye on.
00:28:33: What are you actually commenting on?
00:28:37: Because we know how social media works.
00:28:42: You don't look properly and you're so quickly switching into social media
00:28:48: bubbles, which you didn't want to get in.
00:28:50: So yeah, I think there is a lot of risk involved too.
00:28:55: The same, like, yeah, you wouldn't do like completely automated sales emails without
00:29:00: actually looking over it.
00:29:02: So, or at least you shouldn't.
00:29:04: I think it's like kind of the same.
00:29:06: it's done.
00:29:07: And this is what I'm saying again, like before you automate these pipelines or
00:29:12: these workflows, just kind of consider the impact that those automations will create
00:29:19: as human in the loop, I think, a big portion of it.
00:29:23: And I would say maybe marketing sales, depending on industry, but I would say,
00:29:26: you know, some of these automations like human in the loop is like a must,
00:29:30: especially in highly regulated industries, you would never want to release.
00:29:35: an automated comment, nothing, I would say, like in healthcare and banking.
00:29:40: There's a human always involved, so make sure you consider that.
00:29:43: So don't release anything that's truly automated end to end without having kind
00:29:50: of human in the loop approved for at least that workflow.
00:29:52: Yeah.
00:29:54: There is also, we will come shortly to the challenges.
00:29:57: There's also another thing you have to keep in mind for that.
00:30:01: But before we get to that, I would love to just mention some of the frameworks.
00:30:06: So Google now created, or released, showed Vertex AI agent builder on their cloud
00:30:13: platform.
00:30:14: As far as I understood, and I didn't try it out, so it's like with a grain of salt,
00:30:19: it's more in the realm of GPTs.
00:30:22: Mm -hmm.
00:30:23: OpenAI, which also has to be mentioned here.
00:30:29: Yeah, like this is not an active system.
00:30:32: It's still more of a prompt builder kind of thing where you are able to argument
00:30:39: your context with actions, with third party sources, or with documents in form
00:30:44: of a rag pipeline.
00:30:45: So it's not as sophisticated.
00:30:48: It's even not as sophisticated as some of the demos they showed.
00:30:52: Mm -hmm.
00:30:53: cannot build that with the Vertex AI agent builder, which is kind of sad because it
00:31:00: would have been possible to put a bit more capability into that.
00:31:04: Yeah, I agree.
00:31:07: and then there's open source as always.
00:31:09: And there are two projects which I really, really want to emphasis, put a bit of an
00:31:14: emphasis on.
00:31:15: One is Autogen, which is a Microsoft framework which lets you build AI agents.
00:31:22: And one of the main things to be considered there is like you have
00:31:27: different roles and different personas working together.
00:31:32: And that's...
00:31:34: actually what the framework enables you to to easily or like more easily manage it to
00:31:40: not be the position to program everything yourself.
00:31:44: And there is another completely open source, I think at least there is no big
00:31:50: company behind that.
00:31:52: Or at least not released it because Autogen is basically released by
00:31:55: Microsoft.
00:31:56: And it's called Crew AI.
00:31:59: And I think Crew AI even has a like Pokemon like UI where
00:32:04: you see the people working together and stuff like that.
00:32:08: So yeah, Pokemon, yeah.
00:32:12: you if you like that, that may be a motivating factor for you to try it out.
00:32:18: Yeah, I think, if I'm not completely mistaken, but I think Crew AI was the one
00:32:23: agent platform which has also, might also be something else.
00:32:31: I might be mistaken on that one, I'm sorry.
00:32:34: But Crew AI is basically the latest and most interesting open source project I've
00:32:45: seen and it's...
00:32:47: If I see modern agent videos a lot of the times they use WhoAI because yeah, it's
00:32:53: just working well.
00:32:55: And I think we talked about GPTs also as some sort of, I guess, belonging to the
00:33:03: intelligent agent space.
00:33:06: So how so?
00:33:07: Because I thought it was, you know, a lot of the capabilities of GPTs right now are
00:33:13: basically glorified prompt agents, like prompted agents.
00:33:19: if we mentioned vertex, yeah, agent builder, it's just kind of like, to be
00:33:25: complete, we have to know the GPTs.
00:33:27: From our definition, it's not an agent, but they sell it like it, and people will
00:33:34: read agent in the context of that, so yeah.
00:33:39: Take it as you wish, I will not say like,
00:33:44: force anyone to use agents like we use it, but if you talk about AI agents, from my
00:33:52: point of view and my own opinion, you have to have at least partly autonomous
00:33:57: systems.
00:33:59: Yeah, I guess the playing the devil's advocate, you could potentially, so if
00:34:06: your workflow involves just creating content, right?
00:34:09: So you could potentially automate a big portion of your workflow of creating
00:34:15: content, if that's literally all you're kind of doing, which will probably involve
00:34:19: research, crafting an outline, and then generating.
00:34:22: So like there are a few steps in between for you to get from a blank sheet.
00:34:27: to a polished article, right?
00:34:31: So you could, again, depending on what your workflow is, but it's not going to be
00:34:34: able to automate a conversation.
00:34:37: You know, the GPT having a conversation with a client.
00:34:42: So there's certain tasks that I think it could be, and it's not even trained, but
00:34:47: like prompted or system prompted in order to kind of fulfill as far as multiple
00:34:52: steps.
00:34:53: But still, I think you are kind of...
00:34:55: providing it context, you're explicitly telling it what to do and how to
00:34:59: potentially achieve that task.
00:35:01: So again, depending on how you look at it, you could argue that there is some
00:35:05: intelligence that it's automate.
00:35:08: Some of the steps are automated, but still, it's not truly hands off.
00:35:14: Yeah, yeah, yeah.
00:35:16: And you have to activate it.
00:35:18: And that's where I would love to have its last topic, like what does it mean for you
00:35:24: and your business.
00:35:26: So we've said it at some point already, but we have passive stuff, which is
00:35:32: reacting to requests and try to solve the specific requests.
00:35:37: And then you have active stuff, which kind of is.
00:35:41: meant as autonomous agents which actively create their own tasks and solve them.
00:35:48: And like solve them one by one to reach a certain goal.
00:35:52: And this means basically you can do more complex stuff.
00:35:59: If you do it right, you can basically create virtual employees for that, which
00:36:06: are doing work and solving stuff for you.
00:36:11: And yeah, I think I don't have to tell any business owner how huge such an
00:36:16: opportunity might be.
00:36:19: But there are also challenges involved in that.
00:36:23: And yeah, would you elaborate on that?
00:36:28: Yeah.
00:36:29: So I would say cost and the precision, right?
00:36:34: So you want to make sure that if you're truly automating kind of workflows, that
00:36:41: depending on the solutions or what AI tools you've actually implemented, you can
00:36:48: run into some big inference costs or just costs in general, because again, you've
00:36:53: automated these workflows and you're kind of releasing them into the wild.
00:36:56: especially if you don't have human in the loop.
00:37:00: Some of the costs of these models, especially if you have Generative AI
00:37:03: plugged into those workflows, I think that's the biggest eye -opening kind of
00:37:10: situation is Gen .AI is quite expensive.
00:37:13: So you think the costs go up significantly depending on how many tasks you're
00:37:19: actually running.
00:37:20: So especially if you're doing that as part of your automation workflows.
00:37:24: they can run a big.
00:37:25: And so when you see your first bills, you want to make sure that whatever workflow
00:37:31: you're automating, it's actually worth the investment into the AI.
00:37:36: So the goal is for you to build efficiencies.
00:37:39: And efficiencies, the way they translate to business leaders, I think is ROI.
00:37:44: So what process are you actually trying to improve?
00:37:47: And what's the cost of that process end to end?
00:37:50: And then...
00:37:51: what would be kind of the projected cost for you to actually do this with agents.
00:37:55: So you want to make sure that you've introduced, you didn't introduce a process
00:38:00: that actually costs you more than to do it in kind of the old way.
00:38:04: So there's an ROI ballpark calculation that you'll need to do to make sure that,
00:38:09: you know, it's worth investing, not just for the sake of technology and saying,
00:38:14: hey, I have AI automating my workflows.
00:38:16: It actually has business sense for you to really implement.
00:38:20: And I think you want to make sure that you also look at precision.
00:38:26: And so, and I think, and maybe you'll kind of, I'd love to get your angle on this
00:38:31: too, but humans are really good at adopting.
00:38:35: And so one of the things that AI in general is looking to do is to kind of
00:38:41: replace how we think.
00:38:42: But it's not the same level of competence as we are.
00:38:46: So sometimes some of the processes...
00:38:48: that we run on a daily basis, yes, they're manual, but we are able to be flexible
00:38:53: enough to accommodate new situations and things that we would call edge cases.
00:38:58: We're just intelligent enough to be like, hey, this looks like, you know, this
00:39:02: process, let's do it.
00:39:03: So the accuracy, as you mentioned, like precision, sometimes with humans is higher
00:39:08: than these machines because they're still learning.
00:39:10: So again, depending on the maturity of your model and your process, you may
00:39:14: experience some.
00:39:15: issues as kind of these automated workflows encounters, especially like
00:39:21: these edge cases that they're not, have not been exposed to in kind of their, in
00:39:27: previous.
00:39:28: So yeah, I'd love to get your thoughts on that as well.
00:39:31: Yeah, the precision part is actually the one which is really hurtful because LLMs
00:39:40: are not 100%.
00:39:41: That's what we learned with all the usage of it over the time.
00:39:46: If you think about a system which is not 100 % and at some points even a bit lower,
00:39:57: like 85%, 80%, 60%, who knows?
00:40:01: on one task, and if you just add up enough tasks, and they are all a bit like, it's
00:40:08: like building a tower, and every bit is a bit misplaced.
00:40:13: And at some point, if the tower gets high enough, it just breaks.
00:40:17: And that's basically where we're at.
00:40:22: We are, like, this is something that will be solved with time.
00:40:26: So if you now implement an agent,
00:40:29: and it's maybe not as capable as you would love to have it because of that lack of
00:40:35: precision, this will solve itself by time.
00:40:40: We get newer models which will be more precise, which will do less mistakes and
00:40:44: less hallucinations, and then we get to it.
00:40:48: Another thing also connected to the complexity is context window, which at
00:40:56: least was an issue.
00:40:58: until this week or last week.
00:41:02: Because now we get to the models with a million context, but I think context is
00:41:07: not an issue anymore.
00:41:08: But that's something to be at least aware of that you have context that's building
00:41:15: up because like you create a task, the task had output, you have to consider the
00:41:19: output for the next task and to solve it and stuff like that.
00:41:23: So...
00:41:24: like splitting it up in separate tasks like reduces context sizes significantly,
00:41:28: what is it today?
00:41:31: Significantly, yeah, I don't know.
00:41:34: By a huge amount.
00:41:36: And that's really weird.
00:41:40: Yeah, it reduces it by a huge amount and but you still, if the task is complex
00:41:45: enough, for example, if you have like a lot of code which you wanna execute,
00:41:50: context might still be an issue, especially if you use.
00:41:53: like local models.
00:41:55: So yeah, that's that.
00:41:59: I'm...
00:42:00: note that I'll emphasize that the error being compounded, but I think it's not a
00:42:05: unique issue, which I'd love to talk about maybe one of the future episodes too is if
00:42:10: you're orchestrating multiple systems together, you have to look at the error
00:42:15: rates of each, not just tasks with automated agents, but those precision
00:42:21: rates are like these error rates compound.
00:42:25: And so you have to track kind of where...
00:42:29: and enhance each one of those steps.
00:42:30: So there's monitoring and observability that you have to kind of implement to make
00:42:34: sure that you have a good grasp as to what that's happening.
00:42:37: And so that you have an understanding, okay, well, what part of the process can
00:42:41: you really improve?
00:42:42: So there are things, again, we're not still at the level of maturity that these
00:42:46: automated agents are plug and play.
00:42:49: So there are some as you kind of build them, and I think this is where, Edgar,
00:42:53: you have a lot of experience.
00:42:55: There's some fine tuning that you have to do or some...
00:42:57: Yeah.
00:42:59: kind of molding of the system that you have to do so they're not completely out
00:43:03: of the box.
00:43:05: use fine tuning, stuff like that, yeah.
00:43:08: So do anything you can to avoid hallucinations, but it's the same for
00:43:14: every AI implementation whatsoever.
00:43:17: You have to keep the error rate down.
00:43:20: Like I always say, usually in classic programming, you had input A and you had
00:43:25: output B.
00:43:26: Now you have input A and you have output A to C or A to D.
00:43:31: a good play, I like that.
00:43:32: B, but you have to work with A to B.
00:43:34: But it's fine.
00:43:36: Error correction was something that we had at IT forever, and now we have just a
00:43:43: different way of error correction.
00:43:47: And yeah, it's solvable.
00:43:50: I think I still believe, and I'm a strong believer, and not everyone would agree
00:43:54: with that, but I'm a strong believer that we are...
00:43:58: really close to real intelligence just by engineering it right.
00:44:04: So I think it's not a matter, well, not completely the matter of only getting
00:44:10: better system, but it's also a matter of engineering the right implementation.
00:44:15: So that's why the correct integration of AI systems and knowing what's possible and
00:44:20: knowing the ins and outs is really valuable.
00:44:24: So before you start anything,
00:44:26: try to get this right and try to anticipate what issues you could face and
00:44:34: how to solve it.
00:44:36: And most of them are solvable.
00:44:38: So yeah, that's just a hint that I can give.
00:44:43: Yeah, I think value packed again, hopefully you've found something out of
00:44:51: this episode that you kind of stuck to you.
00:44:54: And we'd love to if there's anything that you want us to dive in a little a level
00:44:59: deeper, or if you want to come as a guest with us, please join us.
00:45:04: We always like to have different guests talking in different topics and speaking
00:45:07: of guests, we will have Mike.
00:45:10: joining us and we'll talk more about his background but on AI for recruiting next
00:45:17: week.
00:45:18: So really excited about that episode.
00:45:20: Yeah.
00:45:21: Because recruiting is a hard task and having it automated with AI is a sweet,
00:45:27: sweet dream.
00:45:30: Yeah.
00:45:31: And I think, I mean, he, we did meet with him.
00:45:34: So I think, again, something to look forward to.
00:45:36: He's developed some really cool tools in kind of his space that he's been able to
00:45:43: implement to support his workflow.
00:45:45: So speaking of agents, maybe if anyone's interested to understand how some of the
00:45:51: tools that we've talked about could be implemented and talk about specific use
00:45:55: cases, tune in on to next week's episode.
00:45:58: But other than that...
00:45:59: Please don't forget to subscribe.
00:46:01: We'd love your support.
00:46:03: It's validating for us to hear that they were on the right track and continue doing
00:46:06: what we're doing and we enjoy talking about this stuff.
00:46:09: So thank you for taking the time and joining us this week.
00:46:13: Yeah, also thank you from my side.
00:46:15: Thank you Lana again for your wonderful co -hosting.
00:46:20: And yeah, it's been 10 episodes now.
00:46:24: Plus we had even an extra video.
00:46:29: So talking about that, keep an eye on our YouTube channel.
00:46:33: We started uploading news episodes for covering the news.
00:46:37: You might notice that we didn't do that in the episode that was on purpose.
00:46:41: So yeah, keep your eyes open, follow us on YouTube and then you might get the latest
00:46:48: AI news in your beloved setting.
00:46:54: And yeah, and we also do screen shares and show stuff there, so stuff we cannot do on
00:46:59: the podcast.
00:47:00: So yeah, please go to YouTube, subscribe and leave a comment.
00:47:04: It helps us.
00:47:07: Thank you.
00:47:07: Thank you for tuning in.
00:47:09: We'll see each other next week.
00:47:10: Bye bye.
New comment