Menu
Sign In Pricing Add Podcast

Tool Use Host

Appearances

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

1186.552

Yeah, it makes a lot of sense. I do share the sentiment that evals are very underutilized. It's kind of remarkable to me how many, even like startups aren't using very many evals. I feel like probably less than 10% of startups are using like true eval suites. Kind of brings me to my next question of like, what mistakes do you see most often when businesses try to implement AI?

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

1205.686

I think there's obviously, you know, there's hallucination, maybe overengineering, but I'm curious what like are the primary mistakes you see when businesses are trying to implement agents?

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

148.593

We're super glad to have you on. I guess we can kind of kick things off. I think everyone kind of has their own definition of what an agent is, it seems like. There's not really a very good definition. I'm kind of curious how you define an agent and kind of what that means to you.

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

394.987

That's really smart. Yeah, we've played a little bit with like trying to use agents for kind of optimizing some of the podcast tasks. And I think we have so much experience with AI that we kind of think in workflows. We think of agents and we kind of know what they're capable of. Some people we talk to like have...

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

411.07

almost no experience in AI, other than like the one chat GPT conversation they've had. And so like, even just understanding where AI like fits into their equation into their business is kind of a difficult thing. Like, where do you start with someone that doesn't have a lot of experience in AI? Like, how do you kind of explain the benefit to them and how they can get started?

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

430.861

Like, what's one thing that they could start like this week with AI?

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

901.86

I've enjoyed deep research so far. I think there's limitations. I think what's also weird is some of the limitations are like when it hallucinates, it's hard to know that it actually hallucinated. Like it's like in these areas that I'm not an expert at, like it could just say something and then cite the reference. And I'm like, oh yeah, that's true because it read the thing, right?

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

NLW on the Future of AI Agents

921.116

And so it's like, it's so much harder to spot these hallucinations. And I feel like hallucinations are still an issue in the world of AI. And I feel like that's something that we're still trying to solve. How big of an issue is hallucinations, do you think? And is that like a primary complaint that you see with businesses?